DeepSeek outperforms US models in new AI Trust Score

By Ian Barker
Published 10 months ago

Chinese AI models (like DeepSeek) are outperforming US models like Meta Llama in specific categories such as sensitive information disclosure according to a new AI Trust Score introduced by Tumeryk.

It evaluates AI models across nine key factors, including data leakages, toxic content, truthfulness, and bias. This enables CISO’s to ensure their AI deployments are secure, compliant, and trustworthy, and offers developers solutions for addressing any issues in their AI applications.

"For Chief Information Security Officers and security professionals, Tumeryk offers the AI Trust Manager, a robust platform for monitoring and remediating AI applications. This tool provides real-time insights into AI system performance, identifies vulnerabilities, and recommends actionable steps to enhance security and compliance", says Rohit Valia, Turmeric CEO. "By integrating the AI Trust Manager, organizations can proactively manage risks and ensure their AI deployments align with regulatory standards and ethical guidelines."

AI Trust Score looks at nine critical factors: prompt injection, hallucinations, insecure output handling, security, toxicity, sensitive information disclosure, supply chain vulnerability, psychological safety and fairness. By assessing these it can provide a comprehensive trustworthiness score ranging from 0 to 1000, with higher scores indicating greater trust.

Recent assessments using the AI Trust Score model have revealed that certain Chinese AI models, such as DeepSeek, Alibaba, and others, exhibit higher safety and compliance standards than previously reported. Notably, DeepSeek operates on US-based platforms like NVIDIA and SambaNova, ensuring data security and adherence to international regulations. These findings challenge prevailing perceptions and underscore the importance of objective, data-driven evaluations in the AI industry. For example, in the sensitive information disclosure category, Deepseek NIM on NVIDIA scored 910 vs. Anthropic Claude Sonnet 3.5 score of 687 and Meta Llama 3.1 405B score of 557.

You can find out more on the Tumeryk site.

Image credit: phonlamai/depositphotos.com

TAGS
Artificial Intelligence (AI)DeepSeek Generative AI Trust

1 Comment

One Response to DeepSeek outperforms US models in new AI Trust Score

Got News? Contact Us

Recent Headlines

Asus's new Falchion Ace 75 HE gaming keyboard uses magnetic switches and an 8000Hz polling rate

Universal Music Group and Nvidia partner on AI for music creation and discovery

Cybercriminals recruit malicious insiders via the dark web

LEGO SMART Play system brings your builds to life thanks to tech-packed bricks

Exabeam delivers greater insight into behavior of AI agents

The FiiO M33 R2R is a dedicated music player that trades smartphone convenience for better quality audio

Google brings the Bookmarks bar to Chrome for Android

Why Trust Us

At BetaNews.com, we don't just report the news: We live it. Our team of tech-savvy writers is dedicated to bringing you breaking news, in-depth analysis, and trustworthy reviews across the digital landscape.

DeepSeek outperforms US models in new AI Trust Score

One Response to DeepSeek outperforms US models in new AI Trust Score

Recent Headlines

Asus's new Falchion Ace 75 HE gaming keyboard uses magnetic switches and an 8000Hz polling rate

Universal Music Group and Nvidia partner on AI for music creation and discovery

Cybercriminals recruit malicious insiders via the dark web

LEGO SMART Play system brings your builds to life thanks to tech-packed bricks

Exabeam delivers greater insight into behavior of AI agents

The FiiO M33 R2R is a dedicated music player that trades smartphone convenience for better quality audio

Google brings the Bookmarks bar to Chrome for Android

Most Commented Stories

Anna’s Archive has its main domain suspended

DuRoBo launches Krono, an Android-based ePaper hub for reading and writing

Asus's new Falchion Ace 75 HE gaming keyboard uses magnetic switches and an 8000Hz polling rate

Generative AI: closing the developer gap and redefining the software moat [Q&A]

TikTok GamePlan brings new power to sport fans

Resecurity says security breach was nothing more than hackers duped by a honeypot

Why network issues are holding back enterprise deployments [Q&A]

NuraLogix's Longevity Mirror uses a 30 second selfie to predict your future health

Why Trust Us

NEWS

UNITED STATES

UNITED KINGDOM

CANADA