Releases: Anemll/Anemll
Releases Β· Anemll/Anemll
0.3.4 Alpha Release
π What's New in 0.3.4
π lm-evaluation-harness Support - Model evaluation with standard benchmarks (BoolQ, ARC Challenge, etc.) - Documentation
π― New RMSNorm Implementation - Precise calculation with ANE hardware ops
π Fixed RoPE Tensor Size Bug - Resolved random overflows (existing pre-0.3.4 models should be re-converted)
####Example ANE vs HF on MPS backend ( Qwen 3 0.6B )
| Task | HF-FP16 | ANEMLL-FP16 | DIFF % |
|---|---|---|---|
| arc_challenge | 31.66% | 30.97% | -0.69% |
| arc_easy | 60.65% | 60.94% | +0.29% |
| boolq | 63.91% | 64.68% | +0.77% |
| piqa | 66.81% | 67.74% | +0.93% |
| winogrande | 56.43% | 56.67% | +0.24% |
| Average | 55.89% | 56.60% | +0.71% |
0.3.3 Alpha Release
π ANEMLL 0.3.3 Alpha Release
π New Features
- π― Qwen 3 Architecture Support - Initial implementation for Qwen3 (0.6B, 1.7, 4B) models
- π¦ Streamlined Installation - One-command setup with ./create_python39_env.sh +
./install_dependencies.sh - π§ͺ Automated Testing Framework - End-to-end validation with python tests/test_qwen_model.py
- π§ Enhanced Developer Experience - Improved error handling and better feedback
v0.3.0-alpha
Sample iOS/macOS inference Chat-Bot App (Alpha)
Updates to Model conversion and upload scripts
Updates to Swift Package and CLI App
0.1.2-alpha
- Dependency checks and troubleshooting guide docs/troubleshooting.md added
- Prefill batch size added to the conversion script
- Chat_full interface updated for DeepHermes "think" token "/t" and both chat interfaces added --nw flag to skip warmup step ( docs/chat.md )
- XCode Tools dependency added to the README
0.1.1-alpha
Release 0.1.1-alpha
- Single-shot model conversion with see convert_model.md
- Simplified model configuration with meta.yaml
- Automated Hugging Face distribution preparation with prepare_hf.sh
- Enhanced Chat Interfaces with better error handling and configuration support
- Improved LLaMA model with prefill optimization
Alpha Release 0.1.0
Initial Support for LLama and Distill DeepSeek for Apple Neural Engine