← heapsort-ai

Tokenization

11 items

RESEARCH↑ trendingReddit r/MachineLearning·19d ago

Do VLMs in production still use fixed-patch ViTs for their vision capabilities? [D]

This discussion questions whether production Vision-Language Models (VLMs) still rely on fixed-patch Vision Transformers (ViTs) for their vision capabilities, despite the existence of more efficient tokenization methods. It explores potential reasons for this, such as marginal gains, pipeline limitations, or unclear scaling laws for adaptive patching.

42
RESEARCHarXiv CS.LG·11d ago

Continuity and Ordinality Matter: Constraining Time Series Tokens for Effective Time Series Analysis with Large Language Models

This paper introduces COM (Continuity and Ordinality Matter), a strategy that integrates geometric constraints into both the initialization and training stages of token-based time series large language models (TS-LLMs). The research demonstrates that preserving continuity and ordinality in time series token embeddings significantly improves model performance and generalizability.

27
RESEARCHarXiv CS.AI·13d ago

BrickAnything: Geometry-Conditioned Buildable Brick Generation with Structure-Aware Tokenization

This work introduces BrickAnything, a geometry-conditioned autoregressive framework for generating physically buildable brick structures from diverse 3D shapes. It uses point clouds as a unified geometric interface and predicts brick sequences that reconstruct the target shape under assembly constraints, introducing structure-aware tree tokenization.

27