AI Insight Hub

by AI Insight Hub 2023-10-10 0

Efficiency Meets Quality: Google & JHU Pioneers Conditional Diffusion Distillation in Just 1-4 Sampling Steps

In a new paper Conditional Diffusion Distillation, a research team from Google Research and Johns Hopkins University introduces an innovative framework that distills an unconditional diffusion model into a conditional one, enabling image generation with significantly fewer steps.

by AI Insight Hub 2023-10-09 2

Mind-to-Speech: The New Frontier in Neuro Communication Through Perception From Non-Invasive Brain Signals

In a new paper Decoding speech perception from non-invasive brain recordings, a research team from Meta AI, Inria Saclay and PSL University exhibits the remarkable capability to decode speech from brain signals recorded non-invasively through magneto-encephalography (MEG) or electro-encephalography (EEG).

by AI Insight Hub 2023-10-06 1

General-Purpose Robot RT-X: A Collaboration between DeepMind and 33 Academic Labs

DeepMind, in collaboration with 33 academic laboratories heralds the arrival of RT-1-X, a novel robotics transformer (RT) model that evolves from RT-1. RT-1-X is meticulously trained on the novel Open X-Embodiment dataset constructed by the researchers and showcases a remarkable 50% improvement in success rates compared to task-specified models.

by AI Insight Hub 2023-10-04 4

Microsoft Unveils the Potential of Large Multimodal Models with GPT-4V(ision)

A Microsoft research team conducts an in-depth analysis of the latest model, GPT-4V(ision). Their report delves into the emerging application scenarios and outlines future research directions for GPT-4V-based systems, with the goal of inspiring research on next-generation multimodal task formulation and the development of more robust LLMs.

by AI Insight Hub 2023-10-03 1

NNAISENSE’s New Class of Generative Model: Bayesian Flow Networks Break Barriers in Handing Discrete Data

A NNAISENSE research team introduces a novel class of generative models known as Bayesian Flow Networks (BFNs). These BFNs combine the power of Bayesian inference with neural networks in an iterative modeling process, enabling successful application to continuous, discretized, and discrete data while maintaining competitive performance.

by AI Insight Hub 2023-10-02 2

Standford U’s MAPTree: Redefining Decision Trees – Precision, Speed, and Efficiency Unleashed

In a new paper MAPTree: Beating “Optimal” Decision Trees with Bayesian Decision Trees, a Stanford University research team introduces MAPTree, an algorithm that confidently uncovers the maximum a posteriori tree within Bayesian Classification and Regression Trees (BCART) posterior, achieving strong performance with significantly leaner and faster trees.

by AI Insight Hub 2023-09-29 1

Microsoft’s CodePlan: Unleashing the Power of Language Models for Repository-Level Coding Tasks

In a recent paper, “CodePlan: Repository-level Coding using LLMs and Planning,” a team from Microsoft Research introduces CodePlan—a versatile framework designed to address the complexities of repository-level coding tasks, encompassing extensive code changes across large, interconnected codebases.

by AI Insight Hub 2023-09-27 2

Meta AI’s Long-Context LLMs: Redefining the Landscape of Natural Language Processing

In a new paper Effective Long-Context Scaling of Foundation Models, a Meta AI research team presents a series of long-context LLMs, built through the pretraining from LLAMA 2. These models support effective context windows of up to 32,768 tokens and outperform all existing open-sourced models in terms of performance.

by AI Insight Hub 2023-09-26 1

The Reversal Curse: Uncovering the Intriguing Limits of Language Models

In a new paper titled “The Reversal Curse: LLMs trained on ‘A is B’ fail to learn ‘B is A'” authored by a collaborative research team from Vanderbilt University, the UK Frontier AI Taskforce, Apollo Research, New York University, the University of Sussex, and the University of Oxford, has unveiled a remarkable shortcoming in auto-regressive large language models (LLMs).

by AI Insight Hub 2023-09-25 1

One half-day of training using a few hundred dollars yields similar results to mainstream large models, open-source and commercial-free domain-specific LLM solution

Being at the forefront of cost reduction and efficiency enhancement for large models, the Colossal-AI team maximizes the core capabilities of LLaMA-2. Through innovative training techniques, Colossal-AI has achieved remarkable results by utilizing only approximately 0.0085 trillion tokens of data, investing 15 hours, and incurring training costs in the range of a few hundred dollars.

by AI Insight Hub 2023-09-24 10

Language Models Redefined: Transforming Textual Mastery into Compression Brilliance

In a new paper Language Modeling Is Compression, a collaborative team from Google DeepMind, Meta AI, and Inria delves into the lossless compression capabilities of foundation models, unveiling their achievement of state-of-the-art compression rates across various data types.

by AI Insight Hub 2023-09-20 1

From Stagnant to Stunning: Google Transforms Still Images into Photo-Realistic Animations

In a paper titled “Generative Image Dynamics,” a Google research team introduces an innovative approach to model natural oscillation dynamics using a single static image. This approach yields photo-realistic animations derived from a lone image, surpassing the performance of previous methods by a substantial margin.

by AI Insight Hub 2023-09-20 1

Unveiling the Enigma: Meta AI & UPC Decodes the Inner Workings of Large Scale Language Models

In a new paper Neurons in Large Language Models: Dead, N-gram, Positional, a research team from Meta AI and Universitat Politècnica de Catalunya conducts comprehensive analysis of a family of Open Pre-trained Transformer Language Models (OPT) up to 66b parameters to provide insights of how feed-forward network (FFN) layers act.

by AI Insight Hub 2023-09-18 8

Revolutionizing Autonomous Agents: AGENTS Framework Puts Power in Your Hands

In a new paper Agents: An Open-source Framework for Autonomous Language Agents, a research team from AIWaves Inc., Zhejiang University and ETH Zürich releases AGENTS, an open-source framework that enables non-specialists for developing and deploying state-of-the-art autonomous language agents with minimal coding work.

by AI Insight Hub 2023-09-15 2

DeepMind Decodes the Puzzle of ‘ Grokking ’ In Neural Network Generalization Through Circuit Efficiency

In a new paper Explaining grokking through circuit efficiency, a DeepMind research team solves the puzzle of the grokking through circuit efficiency theory, revealing that the generalizing solution is slower to learn then memorizing.

by AI Insight Hub 2023-09-14 0

Microsoft’s phi-1.5 Challenges LLMs Scaling Law, Showcases the Crucial Rule for ‘Textbook Quality’ Dataset

A Microsoft research team introduce phi-1.5, a 1.3 billion parameter model trained on a vast dataset of 30 billion tokens, remarkably delivering performance that rivals models five times its size. Moreover, it outperforms most non-frontier LLMs in tackling intricate reasoning tasks.

by AI Insight Hub 2023-09-12 1

Unlocking the Power of Visual Modeling: Microsoft’s Sparse MoEs Redefine Efficiency and Excellence

An Apple research team introduces the concept of sparse Mobile Vision MoEs (V-MoEs), which represents a streamlined and mobile-friendly Mixture-of-Experts architecture that efficiently downscales Vision Transformers (ViTs) while preserving impressive model performance.

by AI Insight Hub 2023-09-12 3

Revolutionizing Optimization: DeepMind Leverages Large Language Models as Intelligent Optimizers

In a new paper Large Language Models as Optimizers, a Google DeepMind research team introduces Optimization by PROmpting (OPRO), an effective method that leverages large language models (LLMs) as optimizers, which can generate optimization solutions conditioned on the natural language that describes the optimization task.

by AI Insight Hub 2023-09-08 2

Equall & Apple’s Revolutionizing Transformers: One Wide Feedforward for Unprecedented Efficiency and Accuracy

A collaborative research effort from Equall and Apple delves into the role of the FFN and uncovers a surprising revelation: despite consuming a significant portion of the model’s parameters, the FFN exhibits high redundancy. As a result, the researchers propose sharing a single FFN across both the encoder and decoder, thereby reducing the parameter count while causing only a modest drop in accuracy.

by AI Insight Hub 2023-09-06 2

Unlocking Limitless Retrieval Power: Google’s MEMORY-VQ Revolutionizes LLMs with Remarkable Compression

In a new paper MEMORY-VQ: Compression for Tractable Internet-Scale Memory, a Google research team introduces MEMORY-VQ, a novel method that significantly reduce storage requirements for memory-based methods while maintaining high performance, achieving 16x compression rate on the KILT benchmark.

by AI Insight Hub 2023-09-05 3

MIT’s AskIt Provides A Unified Programming Interface for Code Generation with LLMs

In a new paper AskIt: Unified Programming Interface for Programming with Large Language Models, a MIT CSAIL research team presents AskIt, a domain-specific language (DSL) tailored for LLMs to accommodate a wide variety of tasks, which substantially reducing practitioners’ developmental overhead and effort for software.

by AI Insight Hub 2023-09-04 3

70 billion parameter LLaMA2 model training accelerated by 195% with best foundation model practice upgraded

Colossal-AI provides revolutionary LLaMA2 training efficiency for 8 to 512 GPUs, fine-tuning, and inference solutions. The 70 billion parameter training can be accelerated by 195%, and provides a fully-managed ML cloud platform solution, greatly reducing the cost of large model development and applications.

by AI Insight Hub 2023-08-31 4

Meta AI’s Nougat Enables Conversion of Mathematic Expressions from PDF Files to Machine Readable Texts

A Meta AI research team presents Neural Optical Understanding for Academic Documents (Nougat), a Visual Transformer model that can effectively convert scientific documents stored in PDF format to a lightweight markup language, even intensive mathematical equations are involved.

by AI Insight Hub 2023-08-31 4

CMU & Tsinghua U’s Prompt2Model Generates Deployable Models Following Natural Language Instructions

In a new paper Prompt2Model: Generating Deployable Models from Natural Language Instructions, a research team from Carnegie Mellon University and Tsinghua University introduces Prompt2Model, a general-purpose approach that is able to use prompting technique to specify system behavior while resulting in a deployable special purpose model that enjoys all the advantages thereof.

by AI Insight Hub 2023-08-29 4

Published In Nature: New Breakthrough of Speech-to-Text BCI Achieves Speed of 62 Words/Minute

Speech brain–computer interfaces (BCIs) is a innovate technology that establishes a communication channel between a user and certain devices viaContinue Reading

by AI Insight Hub 2023-08-29 5

Meta AI Open Sources Code Llama: A SOTA Code-Specialized Llama 2

In a new paper Code Llama: Open Foundation Models for Code, a Meta AI research team releases Code Llama, a family of code-specialized Llama 2 models for code generation and infilling, which achieves state-of-the-art performance against open models on code benchmarks.

by AI Insight Hub 2023-08-26 7

Diversifying AI: DeepMind Pushes AI Toward Creative Game Players

In a new paper Diversifying AI: Towards Creative Chess with AlphaZero, a Google DeepMind research team explores whether artificial intelligence can benefit from creative problem-solving mechanisms identified in human intelligence while pushing to the limits of its computational rationality.

by AI Insight Hub 2023-08-25 2

DeepMind & Toulouse U Contribute Composable Function Preserving Transformations to Boost Transformer Training

In a new paper Composable Function-preserving Expansions for Transformer Architectures, a research team from Google DeepMind and University of Toulouse introduces parameter expansion transformations for transformer-based neural networks while preserving functionality, enabling the expansion of the capability of the model as needed.

by AI Insight Hub 2023-08-22 3

Microsoft’s SpeechX: A Leap in Versatile Generative Speech Synthesis

In a new paper SpeechX: Neural Codec Language Model as a Versatile Speech Transformer, a Microsoft research team presents SpeechX, a versatile, robust, and extensible speech generation model that is capable to address zero-shot TTS and various speech transformation tasks, handling both clean and noisy signals.

by AI Insight Hub 2023-08-21 3

Boston U’s Platpus Provides Quick, Cheap, and Powerful Refinement of LLMs, Achieving Top 1 in Open LLM Leaderboard

In a new paper Platypus: Quick, Cheap, and Powerful Refinement of LLMs, a Boston University research team presents Platpus, a family of fine-tuned and merged Large Language Models (LLMs) that achieves the first place in HuggingFace’s Open LLM Leaderboard by performing quick, cheap and powerful refinement of conventional LLMs.

by AI Insight Hub 2023-08-18 5

HPC-AI Tech Raises 22 Million USD in Series A Funding to Fuel Team Expansion and Business Growth

Singapore – HPC-AI Tech, a pioneering company specializing in efficient large AI model training, is delighted to announce the successful completion of its Series A funding round, securing a total of 22 Million USD.

by AI Insight Hub 2023-08-18 2

Alex Graves’s Team Latest Work, Bayesian Flow Networks Address Discrete Data Generation Issues

In a new paper Bayesian Flow Networks, the NNAISENSE research team presents Bayesian Flow Networks (BFNs), a novel family of generative model manipulates parameters of the data distribution rather than operating on noisy data, which provides an effective solution to deal with discrete data.

by AI Insight Hub 2023-08-16 2

MIT & Harvard’s Open-Source FAn System Enables Real-Time Any Objects Detection, Tracking, and Following

In a new paper Follow Anything: Open-set detection, tracking, and following in real-time, a research team from MIT and Harvard University presents the follow anything system (FAn), an open-set real-time any object following framework that can detect, segment, track, and follow any object, and is able to adapt to new objects using text, images, or click queries.

by AI Insight Hub 2023-08-15 5

Meta AI’s Shepherd Criticize Language Model Outputs to Crash Hallucinations

In a new paper Shepherd: A Critic for Language Model Generation, a Meta AI research team presents Shepherd, a language model that are explicitly tuned to critique model generated outputs as well as to generate feedbacks to suggest improvements on solving the factuality, logical errors, coherence, and alignment issues.

by AI Insight Hub 2023-08-14 2

Futureverse’ Universal High-Quality Text-to-Music Generator JEN-1 Makes Significant Advancements

In a new paper JEN-1: Text-Guided Universal Music Generation with Omnidirectional Diffusion Models, a Futureverse research team presents JEN-1, a universal framework that combines bidirectional and unidirectional modes to generate high-quality music conditioned on either text or music representations.

by AI Insight Hub 2023-08-13 1

DeepMind’s AlphaStar Benchmark Improves RL Offline Agent With 90% Win Rate Against SOTA AlphaStar Supervised Agent

In a new paper AlphaStar Unplugged: Large-Scale Offline Reinforcement Learning, a DeepMind research team presents AlphaStar Unplugged, an unprecedented challenging large-scale offline reinforcement learning benchmark that leverages a offline dataset from StarCraft II for RL agents training.

by AI Insight Hub 2023-08-10 1

Open-Source Large Autoregressive Vision-Language Models: Institutions Join Forces to Replicate DeepMind’s Flamingo Models

In a new paper OpenFlamingo: An Open-Source Framework for Training Large Autoregressive Vision-Language Models, a research team releases OpenFlamingo, an open-source replication of DeepMind’s Flamingo models for training autoregressive vision-language models.

by AI Insight Hub 2023-08-08 1

Microsoft Releases DeepSpeed-Chat for RLHF Training of ChatGPT-like Models

In a new paper DeepSpeed-Chat: Easy, Fast and Affordable RLHF Training of ChatGPT-like Models at All Scales, a Deepspeed of Microsoft research team presents DeepSpeed-Chat, a novel end-to-end RLHF pipeline that provides easy-to-use training and inference for ChatGPT-like models at scale.

by AI Insight Hub 2023-08-07 1

DeepMind & Tokyo U’s WebAgent Realizes Real-World Web Navigation Following Natural Language Instructions

In a new paper A Real-World WebAgent with Planning, Long Context Understanding, and Program Synthesis, a research team from Google DeepMind and The University of Tokyo presents WebAgent, a LLMs-driven real-world web navigation agent that can address real websites tasks following natural language instructions.

by AI Insight Hub 2023-08-06 3

New Study Unleashes The Power of Large Language Models to Master 16000+ Real World APIs

In a new paper ToolLLM: Facilitating Large Language Models to Master 16000+ Real-world APIs, a research team from Tsinghua University, ModelBest Inc., Renmin University of China, Yale University, Tencent Inc. and Zhihu Inc. presents ToolLLM, a general tool-use framework that demonstrates a compelling capability to master 16464 real-world RESTful APIs

TopList

Latest Posts