← heapsort-ai

ensemble learning

3 items

RESEARCHarXiv CS.CL·4/14/2026

Toward Generalized Cross-Lingual Hateful Language Detection with Web-Scale Data and Ensemble LLM Annotations

This research explores improving cross-lingual hate speech detection by leveraging large-scale unlabelled web data and LLM-based synthetic annotations. It shows that continued pre-training of BERT models on web data and fine-tuning with synthetic labels generated by an ensemble of LLMs significantly boosts performance, especially in low-resource settings.

28
RESEARCHarXiv CS.LG·5/4/2026

Smart Ensemble Learning Framework for Predicting Groundwater Heavy Metal Pollution

This study develops a predictive framework to model the Heavy Metal Pollution Index (HPI) in groundwater, integrating response transformations with nested cross-validated ensemble machine learning. It aims to overcome challenges posed by statistical complexity and spatial heterogeneity of contaminants that affect conventional prediction methods.

27