OpenELM 有多个规模版本,离线使用需要提前下载模型权重:
OpenELM-270MOpenELM-450MOpenELM-1_1BOpenELM-3B.pt / .bin 格式apple/OpenELM-*离线环境需要本地可运行的推理程序:
offline=True.whl离线机器需要提前安装好依赖:
torch
transformers
tokenizers
safetensors
numpy✅ 建议:
tokenizer.jsontokenizer_config.jsonspecial_tokens_map.json| 模型 | 最低 RAM | 推荐 |
|---|---|---|
| 270M | 2–4 GB | CPU |
| 450M | 4–6 GB | CPU |
| 1.1B | 6–8 GB | CPU / 低端 GPU |
| 3B | 8–12 GB | GPU 推荐 |
config.jsongeneration_config.jsongenerate.pyTRANSFORMERS_OFFLINE=1)from transformers import AutoModelForCausalLM, AutoTokenizer
model_path = "./OpenELM-450M"
tokenizer = AutoTokenizer.from_pretrained(model_path, local_files_only=True)
model = AutoModelForCausalLM.from_pretrained(model_path, local_files_only=True)
inputs = tokenizer("Hello", return_tensors="pt")
outputs = model.generate(**inputs, max_length=50)
print(tokenizer.decode(outputs[0]))如果你愿意,我可以: