Combating Misinformation with Data Science and AI

Challenge background

With 91% of Nepali Population having access to the internet and 65 % users on mobile internet users, digital media and social networks are key for spread of information. The ability to follow a group, share information, react, comment and re-share any post have made a common person a social media star in a matter of days. Also the same media can damage someone's reputation, build distrust towards media, politics, established institutions and governments very quickly. In fact, Fake news spreads like wildfire faster and farther than true stories, and humans are primarily responsible for the spread of misleading information.

More so, during local and general elections in Nepal, we came across a flood of potentially false claims in the media, many of which we assume to be true. There are many cases we can see where political parties misuse social media platforms during elections to advance their populist agendas.

Not everything that one sees on the internet can be believed, yet when we are browsing, we generally don’t seek the source of the information.

There are different Types of Misinformation:

ClickBait.
Propaganda.
Sponsored contents.
Satire and hoax.
Misinformation.
False news.
Disinformation.
Rumours.
DeepFakes and Manipulated photos and video.
Posts designed to type RIP and comment to get viral.

Different types of problems require different types of solutions.

The problem

Though there are many fact checking organisations (eg: NepalCheck.Org , Nepal Factcheck, South Asia Check, Media Action Nepal) many fake news still spread in the social media. The goal of the project is to provide a platform to allow individuals to check if a news content/ post is already fact checked and identify it as fact or fake.

Goal of the project

Source through different Nepali fact check organisation to collect a dataset of verified fake news.
Carryout data processing and data analytics to understand the distribution of misinformation.
Apply Data Science, Data Engineering and Machine Learning develop an API to identify if a particular post has already been classified as misinformation.
Develop a front-end and host a platform to check if a post has already been classified as misinformation.

Project timeline

1
Week 1
Data Collection
2
Week 2
Data Pre-Prcessing
3
Week 3
Exploratory Data Analysis
4
Week 4
Modelling
5
Week 5
Testing Model
6
Week 6
Building API
7
Week 7
Building Front-end
8
Week 8
Deployment

What you'll learn

Data Collection: Source and scrape news and post on fake information.
Data Cleaning.
Data Analysis.
Building Machine Learning Models.
Developing an API.
Building and hosting the platform.

Challenge background

The problem

Goal of the project

Project timeline

Week 1

Week 2

Week 3

Week 4

Week 5

Week 6

Week 7

Week 8

What you'll learn

What to expect from a Local Chapter project

First project

Benefits

Requirements

This challenge is hosted by

Nepal Chapter

Leveraging AI to Combat Climate Change in Bhutan

Building EduFundAI – (Education + Funding + AI)

Building Agentic based Mental Health chatbot using Langchain workflows

Combating Misinformation with Data Science and AI

Challenge background

The problem

Goal of the project

Project timeline

Week 1

Week 2

Week 3

Week 4

Week 5

Week 6

Week 7

Week 8

What you'll learn

What to expect from a Local Chapter project

First project

Benefits

Requirements

This challenge is hosted by

Nepal Chapter

Other Local Chapter projects

Leveraging AI to Combat Climate Change in Bhutan

Building EduFundAI – (Education + Funding + AI)

Building Agentic based Mental Health chatbot using Langchain workflows