📰 Fake News Classifier Bot

WORK ONLY FOR RUSSIAN LANG

📖 About the Project

This project is a Telegram bot that classifies news headlines as true or fake.
The bot evaluates credibility offline, based entirely on historical data, without accessing the internet.

This leads to an interesting effect:

For example, a headline like
"The heir was killed, a war may start"
may be classified as highly probable, because historically such events often led to wars (e.g., WWI).
As a result, overly dramatic news may appear “true” to the model.

🔍 Key Features

Built with Python using:
- pandas for data preprocessing
- scikit-learn for machine learning
Works through a simple Telegram bot interface
Trained on a small dataset with only two labels:
- ✅ True — reliable news
- ❌ False — fake news
No large-scale Russian dataset with multi-class labels (e.g., propaganda, manipulation, clickbait) is currently available.

🛠️ Tech Stack

Technology	Purpose
Python	Core programming language
pandas	Data processing
scikit-learn	ML model training and evaluation
python-telegram-bot	Telegram Bot API integration

📊 How It Works

A user sends a news headline to the bot.
The bot:
- Preprocesses and tokenizes the text
- Converts it into numerical features
The trained model predicts:
- True or False

💡 Future Ideas

Add additional labels (propaganda, manipulation, clickbait, etc.)

Use larger and more diverse datasets

Experiment with modern NLP models (e.g., transformers)

Explore fact-checking with external APIs

⭐ Contributing

This project is primarily a showcase. Feel free to leave a star or open a pull request with suggestions!

📬 Contact

Open an issue or reach out if you'd like to discuss improvements.

📖 Author note

The idea was to create a bot that determines whether news is fake or not. In fact, the big problem is the correctness of its judgments, the algorithm does not have access to the Internet, it determines the news based on historical data and that's the whole catch. In fact, the news may be fake, but too "harsh" like, for example: "the heir was killed, there may be a war", if you think like an algorithm, then the probability of the event "war" when the event "the heir was killed" occurred is very high (this is how the First World War began). A lot of other things depend on the quality of historical data, in my case there is a rather small dataset and it only has 2 classes (true and false), if there were also, for example, such classes as: propaganda, manipulation, etc., the algorithm would determine this quite well, since it can track down such psychological manipulations in the text But alas, there is no such high-quality dataset in Russian yet (at least I have not found it)

Name		Name	Last commit message	Last commit date
Latest commit History 15 Commits
analysys_dataset		analysys_dataset
app		app
test_model_predict		test_model_predict
.DS_Store		.DS_Store
.gitignore		.gitignore
README.md		README.md
main.py		main.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

📰 Fake News Classifier Bot

WORK ONLY FOR RUSSIAN LANG

📖 About the Project

🔍 Key Features

🛠️ Tech Stack

📊 How It Works

💡 Future Ideas

⭐ Contributing

📬 Contact

📖 Author note

About

Uh oh!

Releases

Languages

alexcfv/news-checker

Folders and files

Latest commit

History

Repository files navigation

📰 Fake News Classifier Bot

WORK ONLY FOR RUSSIAN LANG

📖 About the Project

🔍 Key Features

🛠️ Tech Stack

📊 How It Works

💡 Future Ideas

⭐ Contributing

📬 Contact

📖 Author note

About

Topics

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Languages