SpeakMath: Natural Expressions into Verified Computations

Running the Web Interface (Chatbot)

This project includes an educational chatbot interface that visualizes the compilation pipeline (Lexer, Parser, Interpreter).

Install dependencies:
```
pip install -r requirements.txt
```

Run the Streamlit app:

python -m streamlit run streamlit_app.py

Open the link displayed in the terminal (usually http://localhost:8501).

Running the CLI Demo

SpeakMath is a math-focused natural language mini-programming language that interprets expressions like "find the mean of these values" into verified computations. The LLM suggests operator meanings, while our grammar verifies expressions before evaluation.

University of Malaya | Faculty of Computer Science & Information Technology
WIF3010: Programming Language Concepts | Project Brief 2025

📖 Table of Contents

🤖 About the Project

Project Title: SpeakMath (Topic #3 from WIF3010 Brief)

Core Concept:
Create a math-focused natural mini-language where:

Users write commands like "find the mean of these values"
LLM suggests operator meanings (e.g., "average" → mean)
Our grammar verifies expressions before evaluation
Execution is handled by our own interpreter

Paradigm Extension: Functional Programming (map/reduce/composition)

Why SpeakMath?

Makes mathematical operations accessible through natural language
Combines formal grammar verification with LLM flexibility
Perfect for demonstrating functional programming concepts
Clear scope for proof of correctness

🏗 System Architecture

Updated Architecture with LLM Fallback

graph TB
    A[User Input] --> B[Lexer]
    B --> C[Parser]
    C --> D{Token Known?}
    D -- "MAP/REDUCE<br/>(Grammar-First)" --> E[Parse Map/Reduce]
    D -- "Known Op<br/>(SUM/MEAN/etc)" --> F[Semantic Map]
    D -- "Unknown Verb" --> G[Build Phrase]

    E --> H[AST Builder]
    F --> H
    G --> I{LLM Resolver}

    I -- "SEMANTIC_MAP" --> J[Return Operator]
    I -- "SYNONYM_MAP" --> J
    I -- "Heuristics" --> J
    I -- "LLM API" --> K{Found?}

    K -- "Yes + Metadata" --> J
    K -- "UNKNOWN" --> L[Structured Error]

    J --> H
    L --> M[Error Output]

    H --> N[Interpreter]
    N --> O{Is LLM Resolved?}
    O -- "Yes" --> P[Log Metadata]
    O -- "No" --> Q[Direct Execute]
    P --> Q
    Q --> R[Output]

    style E fill:#90EE90
    style F fill:#90EE90
    style I fill:#FFD700
    style L fill:#FF6B6B
    style P fill:#87CEEB

Legend:

🟢 Green: Grammar-First (No LLM)
🟡 Yellow: LLM Fallback Layer
🔴 Red: Error Handling
🔵 Blue: Metadata Logging

Flow Summary:

Lexer: Tokenizes input (MAP/REDUCE = keywords)
Parser: Matches grammar or builds phrase for unknown verbs
LLM Resolver: 4-layer strategy (Map→Synonym→Heuristic→API)
AST Builder: Creates nodes with LLM metadata
Interpreter: Executes and logs LLM resolutions

See: Complete Architecture Diagram

🎯 LLM Integration Rules

✅ LLM CAN:

Map unknown verb phrases to canonical operators
Provide reasoning for operator choices
Return "UNKNOWN" for unsupported operations
Resolve operation names within map/reduce

❌ LLM CANNOT:

Override grammar rules
Change token types or syntax structure
Generate AST nodes
Handle MAP/REDUCE keywords (always grammar-first)
Execute code

See: LLM Fallback Design

📜 Grammar Design (Week 8)

Syntax Definition

The formal BNF/EBNF syntax definition is available in docs/syntax_definition.md.

Implemented Extensions:

✅ Variable assignment: set x to 5
✅ Conditionals: if x > 10 then print x
✅ Functional ops: map add 2 over [1, 2, 3]

LLM Fallback System

NEW: SpeakMath now includes a sophisticated LLM fallback system that supplements the grammar without overriding it.

Key Features:

🎯 Grammar-First: MAP/REDUCE and known operations always use grammar
🤖 LLM Supplement: Unknown verbs resolved via AI (e.g., "tally up" → sum)
📊 Structured Errors: Detailed failure objects with suggestions
📝 Metadata Tracking: All LLM resolutions logged with reasoning

Documentation:

📘 LLM Fallback Design - Complete design document
📊 Architecture Diagram - System flow diagrams
⚡ Quick Reference - Fast lookup guide
📋 Implementation Summary - Task completion summary

Example:

# Grammar-First (No LLM):
Input: "sum [1, 2, 3]"
Output: 6

# LLM Fallback (AI Resolves Unknown Verb):
Input: "tally up [1, 2, 3]"
LLM: "tally up" → OP_SUM (Reasoning: "tally implies summation")
Output: 6 (with log: "ℹ️ LLM Resolution: 'tally up' → SUM")

# Grammar-First Guarantee (MAP always parsed by grammar):
Input: "map add 2 over [1, 2, 3]"
Grammar: Handles structure, LLM only resolves "add" if needed
Output: [3, 4, 5]

Example Commands

sum 1, 2, 3, 4, 5
mean 10, 20, 30, 40
multiply 5 and 6
tally up [10, 20, 30]        # LLM-resolved
map increment 2 over [1,2,3]  # Grammar-first structure
reduce product on [2,3,4]     # Grammar-first

Sample Parse Tree (Week 8 Deliverable)

Command: "sum 1, 2, 3"

    <command>
        |
    ____|____
   |         |
<operation> <expression>
   |            |
  "sum"    1, 2, 3

👥 Team Roles

Role	Name	Responsibilities
Language Architect	[Insert Name]	Design BNF/EBNF grammar, parse trees
Programmer/Integrator	[Insert Name]	Build lexer/parser, system integration
Semantics & Proof Specialist	[Insert Name]	Semantic mapping, correctness proofs
LLM Integration Engineer	[Insert Name]	LLM API integration, synonym resolution
Runtime & Execution Engineer	[Insert Name]	Execution engine, functional paradigm

📅 Current Progress

Week 5 ✅

Team formation
Project proposal submitted
Group contract signed
Selected "SpeakMath" as project title

Week 8 � (Completed)

Finalize BNF/EBNF grammar
Create sample parse trees
Design semantic mapping table
Implement LLM fallback system
Create comprehensive architecture diagrams
Deliverable 2: Presentation 1 (Grammar + Architecture)

Week 8+ (NEW: LLM Integration)

Implement structured failure objects
Update AST with LLM-resolved nodes
Create grammar-first enforcement tests
Document LLM intervention rules
Build 4-layer resolution strategy

🎯 Next Steps

Immediate (Week 8)

Complete formal grammar definition
Design 3-5 parse tree examples
Create high-level architecture diagram
Prepare Presentation 1

Week 8 (After Presentation)

Start implementing lexer
Build basic parser
Test with simple commands

Week 10

Integrate LLM API
Implement functional features (map/reduce)
Prepare Presentation 2

📊 Assessment Focus

Criteria	Marks	Current Status
Proposal & Grammar Design	15	🟡 In Progress
Parser & Interpreter	20	⏳ Pending
LLM Integration	15	⏳ Pending
Paradigm Extension (Functional)	10	⏳ Pending
Proof of Correctness	10	⏳ Pending
Testing & Evaluation	10	⏳ Pending
Presentations	10	🟡 Preparing
Final Report	10	⏳ Pending

📚 Quick References

Required for Week 8:

BNF/EBNF grammar specification
3-5 example parse trees
High-level system architecture
Semantic mapping plan
LLM integration strategy

Tools We'll Use:

Python 3.8+ (interpreter)
OpenAI API or Gemini (LLM layer)
PLY or recursive-descent parser
GitHub (collaboration)

University of Malaya | FCSIT
Last Updated: Week 10, 2025

Name		Name	Last commit message	Last commit date
Latest commit History 24 Commits
docs		docs
src		src
tests		tests
.gitignore		.gitignore
.python-version		.python-version
README.md		README.md
USER_GUIDE.md		USER_GUIDE.md
requirements.txt		requirements.txt
streamlit_app.py		streamlit_app.py
test_output.txt		test_output.txt
test_results.txt		test_results.txt

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

SpeakMath: Natural Expressions into Verified Computations

Running the Web Interface (Chatbot)

Running the CLI Demo

📖 Table of Contents

🤖 About the Project

Why SpeakMath?

🏗 System Architecture

Updated Architecture with LLM Fallback

🎯 LLM Integration Rules

✅ LLM CAN:

❌ LLM CANNOT:

📜 Grammar Design (Week 8)

Syntax Definition

LLM Fallback System

Example Commands

Sample Parse Tree (Week 8 Deliverable)

👥 Team Roles

📅 Current Progress

Week 5 ✅

Week 8 � (Completed)

Week 8+ (NEW: LLM Integration)

🎯 Next Steps

Immediate (Week 8)

Week 8 (After Presentation)

Week 10

📊 Assessment Focus

📚 Quick References

About

Uh oh!

Releases

Packages

Contributors 4

Uh oh!

Languages

nijam001/SpeakMath

Folders and files

Latest commit

History

Repository files navigation

SpeakMath: Natural Expressions into Verified Computations

Running the Web Interface (Chatbot)

Running the CLI Demo

📖 Table of Contents

🤖 About the Project

Why SpeakMath?

🏗 System Architecture

Updated Architecture with LLM Fallback

🎯 LLM Integration Rules

✅ LLM CAN:

❌ LLM CANNOT:

📜 Grammar Design (Week 8)

Syntax Definition

LLM Fallback System

Example Commands

Sample Parse Tree (Week 8 Deliverable)

👥 Team Roles

📅 Current Progress

Week 5 ✅

Week 8 � (Completed)

Week 8+ (NEW: LLM Integration)

🎯 Next Steps

Immediate (Week 8)

Week 8 (After Presentation)

Week 10

📊 Assessment Focus

📚 Quick References

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Contributors 4

Uh oh!

Languages

Packages