Active Learning Text Labeling Pipeline -- 2 need Software Development

Contact person: Active Learning Text Labeling Pipeline -- 2

Phone:Show

Email:Show

Location: 6 of October, Egypt

Budget: Recommended by industry experts

Time to start: As soon as possible

Project description:
"I have thousands of raw text records coming in every week and I want a hands-off system that can label them continuously while getting smarter on its own. The pipeline must be fully automated: new text is ingested, pre-processed, routed through a Query-by-Committee active-learning loop, and the agreed-upon label is written back to a database or flat file.

Here is the core flow I have in mind: an ensemble of three to five lightweight text-classification models (they can be variations of fine-tuned transformers, fastText, or scikit-learn models) forms the committee. When the committee disagrees beyond a configurable threshold, the sample is earmarked for retraining—no human in the loop for now, just self-training on the growing pseudo-labeled set. I also need a small service or scheduled job that periodically re-trains the individual models on the newly labeled corpus and updates the ensemble’s voting weights.

Discrete deliverables
• Python source code (cleanly modular, ideally with Poetry or pip-tools)
• Dockerfile or docker-compose for one-command spin-up
• Configuration guide: how to adjust disagreement thresholds, add/remove models, and switch storage back-ends (PostgreSQL or Parquet)
• README explaining end-to-end workflow and a quick demo on a sample dataset

Acceptance criteria
• On a provided benchmark set the pipeline achieves at least the same macro-F1 as a single baseline BERT model within three active-learning iterations
• New data dropped into a watch folder is labeled automatically within one minute and the result appears in the output store
• All steps reproducible on a vanilla Ubuntu 22.04 instance with Docker installed only

If you have prior experience wiring active-learning loops for text — especially using Query-by-Committee — let me know what tools you prefer and drop a short outline of how you would structure the ensemble." (client-provided description)


Matched companies (7)

...

Codetreasure Co

🚀 Your Expert Partner for Mobile & Web App Development Unlock the full potential of your business with Codetreasure —a leading provider of tailored … Read more

...

JanakiBhuvi Tech Labs Private Limited

Delivering Future-Ready Digital Solutions in Web Development, E-commerce, Logo Design, and Digital Marketing. We believe innovation is key to navigat… Read more

...

Chirag Solutions

Chirag Solutions is extending its services in website designing & development and software development. Our web and software development is committed… Read more

...

WhizzAct Private Limited

WhizzAct aims to deliver the supreme service at an effective cost, ensuring complete customer satisfaction. Emphatic use of the latest tools and tech… Read more

...

eShop Genius

We’re in the industry With the experience of 12+years created more than 1200 stores and have build brands! At eShop Genius, we are an ISO certi… Read more

...

TG Coders

We create custom apps for businesses and startups TG Coders is a technology partner specializing in creating custom mobile and web applications for … Read more

...

Appeonix Creative Lab

At Appeonix Creative Lab, we are more than just an IT company—we are your growth partners. With a passion for innovation and excellence, we craft cus… Read more