📢 Stop Scope Drift: Join our AI-Powered Project Alignment Webinar 🤖
Projects / Local Chapter Project

Natural Language Processing for Ethiopian Languages

Start Date: March 23, 2023 | 3 years ago


Omdena feature image

Challenge Background

Ethiopia, the oldest independent country in Africa and the only one in the continent with its own alphabet, has a population of almost 120 Million people. Its a land of enormous diversity with more than 80 languages and over 200 dialects. Amharic or Amharigna, is one of the working languages in the country along with Oromigna and Tigrigna.

The rest of the world is rapidly adopting Machine Learning and AI to take advantage of the available language data. Countries, Ethiopia, with low-resource languages remained behind. It's time for them to catch up. The ability to effectively leverage current language technologies can benefit in a variety of ways such as by increasing literacy, preserving legacy languages, doing large-scale analysis, improving efficiency, etc. There is a better amount of data available on the internet today than ever before, and leveraging it to build useful projects remained a challenge.

The Problem

The current problem with Amharic language processing is that there are not enough works for public use. Most research projects remained on the shelf of universities. This project, which is the first in a series of NLP-related projects on local languages, aims to build and consolidate capacity in Amharic language processing by leveraging the latest available data.

Goal of the Project

The goal of the project is to build an end-to-end NLP project. Particularly, We will start with collecting and organizing data, then we will continue with building tools for preprocessing, and later on, we will conclude the project by building a classification model for Amharic news.

Project Timeline

1

Initiation, platforming and teaming

2

Data collection stage

3

Data collection and processing

4

Processing

What you'll learn

Corpus preparation End-to-end NLP project with low resource language (Amharic) Working on a project

First Omdena Local Chapter Project?

Beginner-friendly, but also welcomes experts

Education-focused

Duration: 4 to 8 weeks

Open-source



Your Benefits

Address a significant real-world problem with your skills

Build your project portfolio

Access paid projects (as an Omdena Top Talent)

Get hired at top organizations



Requirements

Good English

Suitable for AI/ Data Science beginners but also more senior collaborators

Learning mindset



Application Form

This Challenge is hosted by:

Become an Omdena Collaborator

media card
Visit the Omdena Collaborator Dashboard Learn More