Projects / Local Chapter Project

Classification of Social Media Content in Algerian Dialect Using NLP and Machine Learning

Start Date: June 17, 2023 | 3 years ago


Omdena feature image

Challenge Background

Social media platforms offer invaluable sources to collect real-world text content to build NLP Solutions which have become easier to build with the advances of language models. However, in countries where local dialects are commonly utilized on social media, NLP engineers encounter numerous obstacles engineers when it comes to develop language-based solutions. In such cases, customized models that handle these dialects are needed

Project Timeline

1

Planning and preparation

2

Data collection

3

  1. Data collection
  2. Annotation/ Building the dataset

4

  1. Annotation/ Building the dataset
  2. Explore NLP State of Art with Algerian dialect

5

  1. Annotation/ Building the dataset
  2. Explore NLP State of Art with Algerian dialect
  3. Building Classifiers using different techniques

6

  1. Building Classifiers using different techniques
  2. Evaluate and compare the performances of the different models

7

Model Deployment

8

Project Wrap up (project report and final presentation)

What you'll learn

NLP, state of art of text classification of algerian dialect, practical experience on machine and deep learning

First Omdena Local Chapter Project?

Beginner-friendly, but also welcomes experts

Education-focused

Duration: 4 to 8 weeks

Open-source



Your Benefits

Address a significant real-world problem with your skills

Build your project portfolio

Access paid projects (as an Omdena Top Talent)

Get hired at top organizations



Requirements

Good English

Suitable for AI/ Data Science beginners but also more senior collaborators

Learning mindset



Application Form

This Challenge is hosted by:

Become an Omdena Collaborator

media card
Visit the Omdena Collaborator Dashboard Learn More