PinnedChris MauckinTowards AIHandling Mislabeled Tabular Data to Improve Your XGBoost ModelReduce prediction errors by 70% using data-centric techniques.7 min read·Jan 15, 2023--1--1
Chris MauckinTowards Data ScienceBeware of Unreliable Data in Model Evaluation: A LLM Prompt Selection case study with Flan-T5You may choose suboptimal prompts for your LLM (or make other suboptimal choices via model evaluation) unless you clean your test data10 min read·Jun 16, 2023--2--2
Chris MauckinTowards Data ScienceEffectively Annotate Text Data for Transformers via Active Learning + Re-labelingBoost Transformer model performance with Active Learning assisted data labeling9 min read·Apr 27, 2023----