summaryrefslogtreecommitdiff
AgeCommit message (Expand)Author
2024-08-19Bump version to 0.0.6Ashwin Bharambe
2024-08-19Move code from llama3_1 -> llama3Ashwin Bharambe
2024-08-15Add `WebMethod.method` for overriding the HTTP method for a routeAshwin Bharambe
2024-08-15updating tool call formats in llama cli (#110)Hardik Shah
2024-08-15note inclusion of prev Q&A pairs in prompt for SQuAD eval in eval_details.md ...Jason Krone
2024-08-14update cli system prompt templatesHardik Shah
2024-08-14Update README.md for llama download (#105)Dalton Flanagan
2024-08-14Add example scripts to show how to run the model (#108)Ashwin Bharambe
2024-08-13Bump version to 0.0.5Hardik Shah
2024-08-13added micro avg metrics for MMLU-Pro to eval_details.md (#106)Kai Wu
2024-08-13update system prompt to drop newlineHardik Shah
2024-08-12upgrade pydantic to latestHardik Shah
2024-08-08Fix n_kv_heads for mp16 modelsAshwin Bharambe
2024-08-08Bump version to 0.0.4dltn
2024-08-08Bump version to 0.0.3Ashwin Bharambe
2024-08-08Merge distros to main (#97)Dalton Flanagan
2024-08-07Improve model list display (#94)Dalton Flanagan
2024-08-07SKUs: chat models categorized as instruct modelsdltn
2024-08-07Add Llama 2 and Llama 3 SKUsdltn
2024-08-07Add checklist.chk to files which are downloadedAshwin Bharambe
2024-08-07parse json for tool calls properlyHardik Shah
2024-08-06mmlu_pro metric should be macro_avg/acc instead of micro_avg/acc_char (#92)Kai Wu
2024-08-05test for custom tool callingHardik Shah
2024-07-31Change how we name SKUs; add safety SKUs (#79)Ashwin Bharambe
2024-07-31Add missing fp8_scales files to `meta-llama-3.1-405b-fp8` (#69)pmysl
2024-07-31fix: chmod +x download.sh (#72)Matthew Berryman
2024-07-31Remove md5sum requirement (#74)minimalic
2024-08-01Create LICENSE (#84)Joseph Spisak
2024-07-31Update README.mdJoseph Spisak
2024-07-31Update README.mdraghotham
2024-07-30move model SKU stuff upwards into top-level llama_models namespaceAshwin Bharambe
2024-07-30Hard code consolidated.XX.pth file sizes for 405B due to CloudfrontAshwin Bharambe
2024-07-29Bump version to 0.0.2Ashwin Bharambe
2024-07-29add recommended sampling paramsAshwin Bharambe
2024-07-29add a mapping from SKU -> llamameta.net subfoldersAshwin Bharambe
2024-07-29make bf16 default quantization formatAshwin Bharambe
2024-07-29Fill up the SKU list for llama modelsAshwin Bharambe
2024-07-29Add PyYAML dependencyAshwin Bharambe
2024-07-27Add links to shieldsDalton Flanagan
2024-07-27Add shields to READMEDalton Flanagan
2024-07-26Update templatesAshwin Bharambe
2024-07-25updated the Huggingface evals link (#53)Kai Wu
2024-07-25download.sh to use curl and chunk downloads of 405b models (#37)Samuel Selvan
2024-07-24Update README.md (#33)Nibesh Kiran Adhikari
2024-07-24fix model_card.md (#36)Kai Wu
2024-07-23Update eval details (#23)Rohit Patel
2024-07-23Fixing meta-llama-3.1-405b-instruct-fp8 (#22)Samuel Selvan
2024-07-23Add link to PyPI and a small blurb about pip installAshwin Bharambe
2024-07-23Standardize URLs (#18)Mario Apra
2024-07-23Update download.sh (#17)wysuperfly