Log4OM V2 - Search News

Deformable ConvNets V2: More Deformable, Better Results

Abstract: The superior performance of Deformable Convolutional Networks arises from its ability to adapt to the geometric variations of objects. Through an examination of its adaptive behavior, we ...

GitHub

OmniParser: Screen Parsing tool for Pure Vision Based GUI Agent

OmniParser is a comprehensive method for parsing user interface screenshots into structured and easy-to-understand elements, which significantly enhances the ability of GPT-4V to generate actions that ...

GitHub

RedPajama-Data-v2: an Open Dataset with 30 Trillion Tokens for Training Large Language Models

RedPajama-V2 is an open dataset for training large language models. The dataset includes over 100B text documents coming from 84 CommonCrawl snapshots and processed using the CCNet pipeline. Out of ...

Some results have been hidden because they may be inaccessible to you

Show inaccessible results

Deformable ConvNets V2: More Deformable, Better Results

OmniParser: Screen Parsing tool for Pure Vision Based GUI Agent

RedPajama-Data-v2: an Open Dataset with 30 Trillion Tokens for Training Large Language Models

Trending now