Newwz.Space

check out different kinds of news here.

Identifying the Source of Generation for Large Language Models

https://arxiv.org/abs/2407.12846...

Identifying the Source of Generation for Large Language Models

Large language models (LLMs) memorize text from several sources of documents. In pretraining, LLM trains to maximize the likelihood of text but neither receives the source of the text nor memorizes the source. Accordingly, LLM can not provide document information on the generated content, and users do not obtain any hint of reliability, which is crucial for factuality or privacy infringement. This work introduces token-level source identification in the decoding step, which maps the token representation to the reference document. We propose a bi-gram source identifier, a multi-layer perceptron with two successive token representations as input for better generalization. We conduct extensive experiments on Wikipedia and PG19 datasets with several LLMs, layer locations, and identifier sizes. The overall results show a possibility of token-level source identifiers for tracing the document, a crucial problem for the safe use of LLMs.

First gene-editing therapy may cure blood disorder

Thursday 08 Aug 2024

resource

Klarna's AI chatbot: how revolutionary is it, really?

Thursday 08 Aug 2024

resource

Cloud storage lockers from Microsoft and Google spread state-sponsored malware

Thursday 08 Aug 2024

resource

Association between prenatal exposure to plastics and autism in boys

Wednesday 07 Aug 2024

resource

Backblaze Drive Stats for Q2 2024

AI Self-Recognition Creates Chances for New Security Risks

Tuesday 06 Aug 2024

resource

Choose your own adventure in 100 lines with micro agents

Monday 05 Aug 2024

resource

The State of the Postgres Community

Monday 05 Aug 2024

resource

Why AI can't fix your production issues

Monday 05 Aug 2024

resource

Implosion of the Yen Carry Trade

Monday 05 Aug 2024

resource

The Case Against Threads

Monday 05 Aug 2024

resource

The SOC2 Starting Seven – Latacora

Monday 05 Aug 2024

resource

Apple now sending up to $395 payments to butterfly keyboard MacBook owners

Sunday 04 Aug 2024

resource

UK's elite hardware talent is being wasted

Sunday 04 Aug 2024

resource

Is eating in front of the TV that bad for you?

Sunday 04 Aug 2024

resource

A Knownbits Abstract Domain for the Toy Optimizer, Correctly

Sunday 04 Aug 2024

resource

Making Debuginfod Viable for the Linux Kernel

Sunday 04 Aug 2024

resource

How to Stay Sane in a World Gone Mad

Saturday 03 Aug 2024

resource

RFC 865 – Quote of the Day Protocol

Saturday 03 Aug 2024

resource

Flowiz – An AI agentic marketplace for document processing

Saturday 03 Aug 2024

resource

Second malaria vaccine launched in Ivory Coast marks new milestone

Saturday 03 Aug 2024

resource

TensorDict: A GPU-accelerated Python dictionary

Friday 02 Aug 2024

resource

An Age of Hyperabundance

Friday 02 Aug 2024

resource

What Happens in a Mind That Can't 'See' Mental Images

Friday 02 Aug 2024

resource

I made an animepahe anime downloader

Friday 02 Aug 2024

resource

Eddiechu/File-Smuggling: HTML smuggling is not an evil, it can be useful

Friday 02 Aug 2024

resource

We haven't seen how bad extreme weather could get

Friday 02 Aug 2024

resource

Kubernetes 1.31 – What's New?

Friday 02 Aug 2024

resource

Working with Terraform Can Be Much Faster

Friday 02 Aug 2024

resource

New Chrome AI features for even more helpful browsing

Thursday 01 Aug 2024

resource

Malaysia is working on an internet 'kill switch', says minister

Thursday 01 Aug 2024

resource

LarryGPT – State of LLM Fine-Tuning in August 2024

Thursday 01 Aug 2024

resource

Clarity needed for complex video-codec patent landscape to thrive

Thursday 01 Aug 2024

resource

AWS Without Access Keys

Thursday 01 Aug 2024

resource

Framework is looking for Linux Community Ambassadors

Thursday 01 Aug 2024

resource

We teach people about George Dantzig

Thursday 01 Aug 2024

resource

NASA's First-Ever Quantum Memory

Wednesday 31 Jul 2024

resource

Boeing Names Kelly Ortberg as Its Chief Executive

Wednesday 31 Jul 2024

resource

What Happened at Baiae, Stayed at Baiae

Wednesday 31 Jul 2024

resource