Combating Misinformation with Data Science and AI
Challenge Background
With 91% of Nepali Population having access to the internet and 65 % users on mobile internet users, digital media and social networks are key for spread of information. The ability to follow a group, share information, react, comment and re-share any post have made a common person a social media star in a matter of days. Also the same media can damage someone's reputation, build distrust towards media, politics, established institutions and governments very quickly. In fact, Fake news spreads like wildfire faster and farther than true stories, and humans are primarily responsible for the spread of misleading information.
More so, during local and general elections in Nepal, we came across a flood of potentially false claims in the media, many of which we assume to be true. There are many cases we can see where political parties misuse social media platforms during elections to advance their populist agendas.
Not everything that one sees on the internet can be believed, yet when we are browsing, we generally don’t seek the source of the information.
There are different Types of Misinformation:
- ClickBait.
- Propaganda.
- Sponsored contents.
- Satire and hoax.
- Misinformation.
- False news.
- Disinformation.
- Rumours.
- DeepFakes and Manipulated photos and video.
- Posts designed to type RIP and comment to get viral.
Different types of problems require different types of solutions.
The Problem
Though there are many fact checking organisations (eg: NepalCheck.Org , Nepal Factcheck, South Asia Check, Media Action Nepal) many fake news still spread in the social media. The goal of the project is to provide a platform to allow individuals to check if a news content/ post is already fact checked and identify it as fact or fake.
Goal of the Project
- Source through different Nepali fact check organisation to collect a dataset of verified fake news.
- Carryout data processing and data analytics to understand the distribution of misinformation.
- Apply Data Science, Data Engineering and Machine Learning develop an API to identify if a particular post has already been classified as misinformation.
- Develop a front-end and host a platform to check if a post has already been classified as misinformation.
Project Timeline
Data Collection
Data Pre-Prcessing
Exploratory Data Analysis
Modelling
Testing Model
Building API
Building Front-end
Deployment
What you'll learn
- Data Collection: Source and scrape news and post on fake information.
- Data Cleaning.
- Data Analysis.
- Building Machine Learning Models.
- Developing an API.
- Building and hosting the platform.
First Omdena Local Chapter Project?
Beginner-friendly, but also welcomes experts
Education-focused
Duration: 4 to 8 weeks
Open-source
Your Benefits
Address a significant real-world problem with your skills
Build your project portfolio
Access paid projects (as an Omdena Top Talent)
Get hired at top organizations
Requirements
Good English
Suitable for AI/ Data Science beginners but also more senior collaborators
Learning mindset
Application Form
This Challenge is hosted by:
Become an Omdena Collaborator

