Blog posts on Research

Data intelligence too cheap to meter: Refuel-LLM2-mini

Read More

Announcing Refuel-LLM

Announcing the launch of Refuel LLM, a large language model purpose built for data labeling and enrichment tasks.

Read More

Improving data quality with confidence

In this post we examine different techniques for estimating confidence of LLM generated labels, and demonstrate how to leverage these to automatically reject low confidence labels and ensemble LLMs optimally

Read More

Is the new gpt-3.5-turbo model worse?

In this report, we compare the latest models from OpenAI against their previous versions on a data labeling benchmark to find that gpt-3.5-turbo is worse for 6/8 datasets, while gpt-4 performance remains the same.

Read More

LLMs can structure data as well as humans, but 100x faster

In this report, we show that LLMs can label datasets 20x faster, and 7x cheaper, but at the same or better quality compared to skilled human annotators.

Read More

Research

Data intelligence too cheap to meter: Refuel-LLM2-mini

Announcing Refuel LLM-2

Announcing Refuel-LLM

Improving data quality with confidence

Is the new gpt-3.5-turbo model worse?

LLMs can structure data as well as humans, but 100x faster

Get updates from Refuel