Feed aggregator

Git Koans

Hacker News - Wed, 04/23/2025 - 5:42pm
Categories: Hacker News

Show HN: Fast and Quality Code Chunking with Chonkie

Hacker News - Wed, 04/23/2025 - 5:41pm

Hi HN,

We’re Chonkie (https://github.com/chonkie-inc/chonkie) — we build open source tools that help split documents into meaningful chunks for use with AI models.

When you use LLMs over large documents or codebases, you often need to break them into smaller parts to fit the model’s context window. Our chunkers do this in a smart way: they preserve structure and meaning, so only the most relevant pieces are passed into the model. This reduces hallucinations, avoids confusion, and improves performance and accuracy.

Today we’re launching our Code Chunker — a fast, structure-aware way to break down source code into high-quality, token-aware chunks.

How it works:

(See the code: https://github.com/chonkie-inc/chonkie/blob/main/src/chonkie...)

Code Chunker uses tree-sitter (https://tree-sitter.github.io/tree-sitter/) to parse your code into an abstract syntax tree (AST). It then recursively merges and groups nodes in a way that respects both code structure and token limits.

It supports all languages that tree-sitter supports, and is designed to preserve formatting and semantics. Large functions or class definitions won’t be split in the middle of a block — instead, we dive recursively into the AST to produce clean, coherent chunks that fit your configured token budget.

What it’s useful for:

- Embedding-based code search - RAG (retrieval-augmented generation) over codebases - Long-context analysis of code - Preparing repos for fine-tuning or pretraining Try it out:

- Open source package: https://docs.chonkie.ai/chunkers/code-chunker - Hosted playground (free with account): https://cloud.chonkie.ai Happy Chonking!

Comments URL: https://news.ycombinator.com/item?id=43776908

Points: 1

# Comments: 0

Categories: Hacker News

How is π.com a valid domain name?

Hacker News - Wed, 04/23/2025 - 5:39pm

Article URL: https://www.xn--1xa.com/

Comments URL: https://news.ycombinator.com/item?id=43776891

Points: 2

# Comments: 4

Categories: Hacker News

Shopify faces privacy lawsuit for collecting customer data

Malware Bytes Security - Wed, 04/23/2025 - 5:38pm

Shopify faces a data privacy class action lawsuit in the US that could change the way globally active companies can be held accountable.

The proposed class action is a revival of a case that had been dismissed by a lower court judge and a three-judge 9th Circuit Court of Appeals panel. But now it’s been brought back after a decision by the full 9th Circuit.

Shopify is a global commerce platform headquartered in Ottawa, Canada. It provides the infrastructure and tools that businesses of all sizes use for retail operations, both online and offline.

To provide these services, Shopify collects personally identifiable information (PII) from buyers, primarily to facilitate and improve their commerce experience. This data includes names, email addresses, phone numbers, shipping and billing addresses, IP addresses, device information, and behavioral data. That is, all the information needed for processing orders, managing payments, shipping products, and communicating with end customers effectively.

With this collection of PII comes responsibility. Shopify acknowledges the data belongs to the users and is collected only to the extent necessary to provide its services. It claims to implement robust security measures to protect this data from unauthorized access and complies with relevant privacy laws such as GDPR.

But Brandon Briskin, a California resident claims Shopify installed tracking cookies on his iPhone without his consent when he bought athletic wear from a retailer, and used his data to create a profile it could sell to other merchants.

The case was at first dismissed after Shopify argued it should not be sued in California because it operates nationwide and did not aim its conduct toward that state.

The dismissal was revoked because the judges found that:

“Shopify deliberately reached out … by knowingly installing tracking software onto unsuspecting Californians’ phones so that it could later sell the data it obtained, in a manner that was neither random, isolated, or fortuitous.”

A Shopify spokesman told Reuters that the decision makes online retailers vulnerable to lawsuits anywhere and “attacks the basics of how the internet works,” and that it drags entrepreneurs who run online businesses into distant courtrooms regardless of where they operate.

Briskin’s lawyer said the court bolstered accountability for internet-based companies by rejecting the argument that a company is jurisdictionally ‘nowhere’ because it does business ‘everywhere.’

And many US states agreed they need an ability to enforce their own consumer protection laws against companies that avail themselves of local marketplaces through the internet.

The general expectation is that this decision could make it easier for American courts to assert jurisdiction over internet-based platforms. The majority of the 9th Circuit, which includes nine western US states, Guam, and the Northern Mariana Islands, adhered to the “traveling cookie rule” because it “impermissibly manufactures jurisdiction wherever the plaintiff goes.”

We don’t just report on data privacy—we help you remove your personal information

Cybersecurity risks should never spread beyond a headline. With Malwarebytes Personal Data Remover, you can scan to find out which sites are exposing your personal information, and then delete that sensitive data from the internet.

Categories: Malware Bytes

The rise of AI powered threats and non-human identities will push an already strained identity security model over the edge

Computer Weekly Feed - Wed, 04/23/2025 - 5:03pm
The rise of AI powered threats and non-human identities will push an already strained identity security model over the edge
Categories: Computer Weekly

Mandiant's latest annual threat report reveals new data on how financially-motivated cyber criminals, such as ransomware gangs, dominate the cyber security landscape

Computer Weekly Feed - Wed, 04/23/2025 - 5:03pm
Mandiant's latest annual threat report reveals new data on how financially-motivated cyber criminals, such as ransomware gangs, dominate the cyber security landscape
Categories: Computer Weekly

Ask HN: What Have You Replaced with AI?

Hacker News - Wed, 04/23/2025 - 4:55pm

Comments URL: https://news.ycombinator.com/item?id=43776530

Points: 1

# Comments: 0

Categories: Hacker News

Today's NYT Connections: Sports Edition Hints and Answers for April 24, #213

CNET Feed - Wed, 04/23/2025 - 4:54pm
Hints and answers for the NYT Connections: Sports Edition puzzle, No. 213, for April 24.
Categories: CNET

Pages