← heapsort-ai

data privacy

39 items

ARTICLEDEV.to AI·1d ago

记忆商人

The story describes Hassan, a 'memory merchant' in 2071 who illegally sells stolen AI memory data, raising questions about privacy. It explores the dilemma users face between AI functionality with memory enabled and the security of their personal data, with no ideal solution for private storage yet.

61
RESEARCHarXiv CS.CL·13d ago

Pretraining Data Exposure in Large Language Models: A Survey of Membership Inference, Data Contamination, and Security Implications

This paper offers the first unified survey of Pretraining Data Exposure (PDE) in Large Language Models (LLMs), covering data contamination and membership inference. It formalizes PDE, reviews attack and defense methods, and highlights open challenges to ensure evaluation integrity and protect privacy.

29
ARTICLEDEV.to AI·5/4/2026

BizNode captures every interaction into a PostgreSQL CRM — leads, conversations, emails, all searchable and exportable

BizNode is an autonomous AI business operator that runs entirely on your machine, offering full control over business automation without cloud subscriptions or monthly fees. It captures all interactions into a private, searchable PostgreSQL CRM, ensuring data never leaves your device and is powered by local AI.

28
ARTICLEDEV.to AI·4/24/2026

Privacy-Preserving Active Learning for precision oncology clinical workflows for extreme data sparsity scenarios

The author recounts their struggle to develop a precision oncology model for rare pediatric sarcoma, facing extreme data sparsity (47 samples) and strict HIPAA/GDPR constraints that prevented data sharing across institutions. This personal journey underscores the critical need for privacy-preserving active learning to address these challenges in real-world clinical workflows.

27
ARTICLEDEV.to AI·4/24/2026

Your RAG Pipeline Is Leaking Customer Data Into Vector Embeddings

This content warns about Personally Identifiable Information (PII) leakage in RAG pipelines via vector embeddings, outlining risks such as cross-user data exposure, challenges with GDPR's right to erasure, and vendor exposure. It suggests sanitizing data before embedding to maintain semantic meaning while ensuring privacy.

27
ARTICLEDEV.to AI·5/7/2026

BizNode sends personalized follow-up emails automatically to every lead your bot captures — nurture prospects while you sleep

BizNode is a self-hosted, autonomous AI business operator designed to automate lead capture and nurturing without cloud subscriptions. It utilizes a Telegram AI bot and a local AI brain with semantic memory (Ollama Qwen3.5, Qdrant RAG) to send personalized follow-up emails, ensuring data privacy by keeping all operations on the user's machine.

27