Thanks to visit codestin.com
Credit goes to github.com

Skip to content

mosure/burn_dragon_hatchling

Repository files navigation

burn_dragon_hatchling 🔥🐉🐣

test GitHub License crates.io

burn inference and training of dragon hatchling model

Alt text Alt text

features

  • cached inference
  • training benchmarks and reporting
  • adaptive tool discovery
  • conditional (deep) gating
  • document-coherent dataloading and scale mixup
  • episodic memory
  • fused kernels
  • hierarchical, memory-aware recurrent state
  • mixture-of-expert routing
  • multi-modal architecture
  • multi-stream truncated backpropagation through time
  • neuromorphic backend
  • rl reasoning training
  • sparsity metrics and visualization
  • streaming, sparse synaptic backpropagation
  • temporal neuron dampening

Dataset configuration (built-in presets and Hugging Face examples) is documented inline in config/base.toml.

training

  • cargo run --release (defaults to the cuda backend)

inference

  • cargo run --bin infer -- --max-tokens 2048 --streaming

benchmarks

  • cargo bench (executes both wgpu and cuda benchmarks)
  • open target/criterion/report/index.html

compatible burn versions

burn_dragon_hatchling burn
0.1 0.18

license

licensed under either of

at your option.

contribution

unless you explicitly state otherwise, any contribution intentionally submitted for inclusion in the work by you, as defined in the Apache-2.0 license, shall be dual licensed as above, without any additional terms or conditions.

Alt text

About

burn inference and training of baby dragon hatchling

Topics

Resources

License

Apache-2.0, MIT licenses found

Licenses found

Apache-2.0
LICENSE-APACHE
MIT
LICENSE-MIT

Stars

Watchers

Forks

Releases

No releases published

Languages