Natural Language Processing for Ethiopian Languages
Challenge Background
Ethiopia, the oldest independent country in Africa and the only one in the continent with its own alphabet, has a population of almost 120 Million people. Its a land of enormous diversity with more than 80 languages and over 200 dialects. Amharic or Amharigna, is one of the working languages in the country along with Oromigna and Tigrigna.
The rest of the world is rapidly adopting Machine Learning and AI to take advantage of the available language data. Countries, Ethiopia, with low-resource languages remained behind. It's time for them to catch up. The ability to effectively leverage current language technologies can benefit in a variety of ways such as by increasing literacy, preserving legacy languages, doing large-scale analysis, improving efficiency, etc. There is a better amount of data available on the internet today than ever before, and leveraging it to build useful projects remained a challenge.
The Problem
The current problem with Amharic language processing is that there are not enough works for public use. Most research projects remained on the shelf of universities. This project, which is the first in a series of NLP-related projects on local languages, aims to build and consolidate capacity in Amharic language processing by leveraging the latest available data.
Goal of the Project
The goal of the project is to build an end-to-end NLP project. Particularly, We will start with collecting and organizing data, then we will continue with building tools for preprocessing, and later on, we will conclude the project by building a classification model for Amharic news.
Project Timeline
Initiation, platforming and teaming
Data collection stage
Data collection and processing
Processing
What you'll learn
Corpus preparation End-to-end NLP project with low resource language (Amharic) Working on a project
First Omdena Local Chapter Project?
Beginner-friendly, but also welcomes experts
Education-focused
Duration: 4 to 8 weeks
Open-source
Your Benefits
Address a significant real-world problem with your skills
Build your project portfolio
Access paid projects (as an Omdena Top Talent)
Get hired at top organizations
Requirements
Good English
Suitable for AI/ Data Science beginners but also more senior collaborators
Learning mindset
Application Form
This Challenge is hosted by:
Become an Omdena Collaborator

