Thanks to visit codestin.com
Credit goes to Github.com

Skip to content

Releases: Anemll/Anemll

0.3.4 Alpha Release

07 Jul 22:38
6ecb10d

Choose a tag to compare

0.3.4 Alpha Release Pre-release
Pre-release

πŸ”„ What's New in 0.3.4

πŸ“Š lm-evaluation-harness Support - Model evaluation with standard benchmarks (BoolQ, ARC Challenge, etc.) - Documentation
🎯 New RMSNorm Implementation - Precise calculation with ANE hardware ops
πŸ› Fixed RoPE Tensor Size Bug - Resolved random overflows (existing pre-0.3.4 models should be re-converted)

####Example ANE vs HF on MPS backend ( Qwen 3 0.6B )

Task HF-FP16 ANEMLL-FP16 DIFF %
arc_challenge 31.66% 30.97% -0.69%
arc_easy 60.65% 60.94% +0.29%
boolq 63.91% 64.68% +0.77%
piqa 66.81% 67.74% +0.93%
winogrande 56.43% 56.67% +0.24%
Average 55.89% 56.60% +0.71%

0.3.3 Alpha Release

19 Jun 19:16

Choose a tag to compare

0.3.3 Alpha Release Pre-release
Pre-release

πŸš€ ANEMLL 0.3.3 Alpha Release

πŸ†• New Features

  • 🎯 Qwen 3 Architecture Support - Initial implementation for Qwen3 (0.6B, 1.7, 4B) models
  • πŸ“¦ Streamlined Installation - One-command setup with ./create_python39_env.sh +
    ./install_dependencies.sh
  • πŸ§ͺ Automated Testing Framework - End-to-end validation with python tests/test_qwen_model.py
  • πŸ”§ Enhanced Developer Experience - Improved error handling and better feedback

v0.3.0-alpha

12 Apr 17:08

Choose a tag to compare

v0.3.0-alpha Pre-release
Pre-release

Sample iOS/macOS inference Chat-Bot App (Alpha)
Updates to Model conversion and upload scripts
Updates to Swift Package and CLI App

0.1.2-alpha

16 Feb 19:40
d3b2e36

Choose a tag to compare

0.1.2-alpha Pre-release
Pre-release
  • Dependency checks and troubleshooting guide docs/troubleshooting.md added
  • Prefill batch size added to the conversion script
  • Chat_full interface updated for DeepHermes "think" token "/t" and both chat interfaces added --nw flag to skip warmup step ( docs/chat.md )
  • XCode Tools dependency added to the README

0.1.1-alpha

10 Feb 15:04

Choose a tag to compare

0.1.1-alpha Pre-release
Pre-release

Release 0.1.1-alpha

  • Single-shot model conversion with see convert_model.md
  • Simplified model configuration with meta.yaml
  • Automated Hugging Face distribution preparation with prepare_hf.sh
  • Enhanced Chat Interfaces with better error handling and configuration support
  • Improved LLaMA model with prefill optimization

Alpha Release 0.1.0

09 Feb 04:18

Choose a tag to compare

Alpha Release 0.1.0 Pre-release
Pre-release

Initial Support for LLama and Distill DeepSeek for Apple Neural Engine