index
:
packy/llama-models
main
Unnamed repository; edit this file 'description' to name the repository.
summary
refs
log
tree
commit
diff
log msg
author
committer
range
path:
root
/
models
Age
Commit message (
Expand
)
Author
2024-09-30
<debug>
Ashwin Bharambe
2024-09-30
Update MODEL_CARD - Feedback links/format (#153)
machina-source
2024-09-30
Minor typos (#154)
Mark Sze
2024-09-25
small fix on the model card (#151)
Kai Wu
2024-09-25
Create eval_details.md
Rohit Patel
2024-09-25
Support for Llama3.2 series of models (#150)
Ashwin Bharambe
2024-09-24
Update MODEL_CARD.md
Joseph Spisak
2024-09-23
[API updates] (#146)
Ashwin Bharambe
2024-09-15
chore: update schema_utils.py (#142)
Ikko Eltociear Ashimine
2024-09-14
Remove duplicated target computation in generation.py (#139)
Yufeng Zhang
2024-09-14
Add an explicit `pth_file_count` field
Ashwin Bharambe
2024-09-13
update vocab size for llama2 models
Ashwin Bharambe
2024-09-13
Drop confusing hardware_requirements from SKU
Ashwin Bharambe
2024-09-04
Only strip out assistant header
Ashwin Bharambe
2024-09-04
Make sure prompt does not have trailing whitespace
Ashwin Bharambe
2024-09-04
Fix import
Ashwin Bharambe
2024-09-03
API updates (#132)
Ashwin Bharambe
2024-08-27
remove pydantic pinning from another requirements.txt
Ashwin Bharambe
2024-08-24
fix max_seq_len for llama_guard_2
Ashwin Bharambe
2024-08-22
Another fix for llama-guard-2
Ashwin Bharambe
2024-08-22
Llama2 download paths are lower case :/
Ashwin Bharambe
2024-08-21
Fix docstring args names (#119)
Sergii Dymchenko
2024-08-21
Added Llama Guard 2 8b sku (#117)
varunfb
2024-08-20
Use tokenizer.model from the repository for the example scripts
Ashwin Bharambe
2024-08-19
Move code from llama3_1 -> llama3
Ashwin Bharambe
2024-08-15
Add `WebMethod.method` for overriding the HTTP method for a route
Ashwin Bharambe
2024-08-15
updating tool call formats in llama cli (#110)
Hardik Shah
2024-08-15
note inclusion of prev Q&A pairs in prompt for SQuAD eval in eval_details.md ...
Jason Krone
2024-08-14
update cli system prompt templates
Hardik Shah
2024-08-14
Add example scripts to show how to run the model (#108)
Ashwin Bharambe
2024-08-13
added micro avg metrics for MMLU-Pro to eval_details.md (#106)
Kai Wu
2024-08-13
update system prompt to drop newline
Hardik Shah
2024-08-12
upgrade pydantic to latest
Hardik Shah
2024-08-08
Fix n_kv_heads for mp16 models
Ashwin Bharambe
2024-08-08
Merge distros to main (#97)
Dalton Flanagan
2024-08-07
Improve model list display (#94)
Dalton Flanagan
2024-08-07
SKUs: chat models categorized as instruct models
dltn
2024-08-07
Add Llama 2 and Llama 3 SKUs
dltn
2024-08-07
Add checklist.chk to files which are downloaded
Ashwin Bharambe
2024-08-07
parse json for tool calls properly
Hardik Shah
2024-08-06
mmlu_pro metric should be macro_avg/acc instead of micro_avg/acc_char (#92)
Kai Wu
2024-08-05
test for custom tool calling
Hardik Shah
2024-07-31
Change how we name SKUs; add safety SKUs (#79)
Ashwin Bharambe
2024-07-31
Add missing fp8_scales files to `meta-llama-3.1-405b-fp8` (#69)
pmysl
2024-07-31
fix: chmod +x download.sh (#72)
Matthew Berryman
2024-07-31
Remove md5sum requirement (#74)
minimalic
2024-07-30
move model SKU stuff upwards into top-level llama_models namespace
Ashwin Bharambe
2024-07-30
Hard code consolidated.XX.pth file sizes for 405B due to Cloudfront
Ashwin Bharambe
2024-07-29
add recommended sampling params
Ashwin Bharambe
2024-07-29
add a mapping from SKU -> llamameta.net subfolders
Ashwin Bharambe
[next]