HomeTechUK AI startup Cosine raises $2.5M for its AI developer outperforming human...

UK AI startup Cosine raises $2.5M for its AI developer outperforming human coders

Date:

Related stories

Starmer must agree youth mobility pact with EU, says business group

Unlock the Editor’s Digest for freeRoula Khalaf, Editor of...

UK exporters still struggling with post-Brexit rules, says trade body

Many exporters are still struggling with post-Brexit trading rules,...

Labour accused of plunging UK into recession amid fury over policies

Labour is plunging the UK into a recession after...

Exeter Science boss appointed to head UK Science Park Association – South West

Sally Basker, chief executive of Exeter Science Park, has...
spot_imgspot_img

UK-founded AI startup Cosine has raised $2.5 million from SOMA and Uphonest in a round that also included funds like Lakestar and Focal. In parallel, it achieved a breakthrough in AI-assisted software development with its artificial developer, Genie.

Based between San Francisco and London, Cosine’s Artificial Developer, Genie, works like a very good human developer. 

For instance, it is able to solve bugs, build features, refactor code, and everything in between either fully autonomously or collaboratively with other developers. 

Founded in 2022, Y-Combinator-backed Cosine’s software was created out of the founder’s realisation of the potential in using LLMs to perform complex tasks in the coding space by imitating human software developers’ behaviours. It is uncannily ‘human’ in its approach to reasoning as a result, with the founders’ primary goal to create truly resilient AI capable of tackling open-ended problems across various domains

The company has achieved a 30 per cent score on SWE-Bench, the industry standard for evaluating software engineering skills in AI models. 

The benchmark, which includes real-world human tasks in software architecture, debugging and implementing new features in existing codebases, assesses an AI model’s ability to understand, modify, and generate complex code. This marks the highest score achieved by any company to date 

Cosine’s score represents a 56 per cent improvement over the previous best score, held by Factory at 19 per cent and 2196 per cent improvement over OpenAI’s GPT4 score of 1.31 per cent. 

By fine-tuning models to emulate human reasoning, Cosine’s approach has beaten out rivals like AWS’s Amazon Q Developer and Cognition’s Devin, both of which scored under 20 per cent  on the same benchmark.

Cosine CEO, Alistair Pullen, shared: 

“Our breakthrough in codifying human reasoning is allowing us to train AI models to operate far beyond the narrow range of tasks and tightly restricted prompts currently available to teams developing software.” 

“We’ve developed a product capable of beating OpenAI and others in completing complex software tasks – in a fraction of the time and money it has taken our competitors to achieve the same results. We’re on course to radically transform the way development and developers work”, continued COO Yang Li. 

“Cosine is not just improving AI; they’re fundamentally teaching AI to reason, providing companies with a true AI colleague”, said Ellen Ma, Partner at Uphonest Capital

- Never miss a story with notifications

- Gain full access to our premium content

- Browse free from up to 5 devices at once

Latest stories

spot_img