NLP Newsletter

Share this post

🥇Top ML Papers of the Week

nlpnews.substack.com

🥇Top ML Papers of the Week

The top ML Papers of the Week (Feb 6 - Feb 12)

elvis
Feb 12
7
Share this post

🥇Top ML Papers of the Week

nlpnews.substack.com

In this issue, we cover the top ML Papers of the Week (Feb 6 - Feb 12).


1). Toolformer - introduces language models that teach themselves to use external tools via simple API calls. (paper)

Twitter avatar for @timo_schick
Timo Schick @timo_schick
🎉 New paper 🎉 Introducing the Toolformer, a language model that teaches itself to use various tools in a self-supervised way. This significantly improves zero-shot performance and enables it to outperform much larger models. 🧰 🔗 Link: arxiv.org/abs/2302.04761
Image
2:51 PM ∙ Feb 10, 2023
1,178Likes240Retweets

2). Describe, Explain, Plan, and Select - proposes using language models for open-world game playing. (paper)

Twitter avatar for @jeasinema
Xiaojian Ma @jeasinema
Describe, Explain, Plan and Select: Interactive Planning with Large Language Models Enables Open-World Multi-Task Agents Can ChatGPT help with open-world game playing, like Minecraft? It sure can! 🧵👇 arxiv.org/abs/2302.01560 github.com/CraftJarvis(code will be shipped soon)
Image
2:54 AM ∙ Feb 6, 2023
479Likes109Retweets

3). A Categorical Archive of ChatGPT Failures - a comprehensive analysis of ChatGPT failures for categories like reasoning, factual errors, maths, and coding. (paper)

Twitter avatar for @omarsar0
elvis @omarsar0
A Categorical Archive of ChatGPT Failures Comprehensive analysis of ChatGPT failures for categories like reasoning, factual errors, maths, and coding. If you are developing with LLMs it's important to know these failures. Good to see them documented. arxiv.org/abs/2302.03494 https://t.co/UYmdranQvr
Image
2:11 AM ∙ Feb 8, 2023
757Likes208Retweets

4). Hard Prompts Made Easy - optimizing hard text prompts through efficient gradient-based optimization. (paper)

Twitter avatar for @tomgoldsteincs
Tom Goldstein @tomgoldsteincs
We rack our brains making prompts for #StableDiffusion and Language Models. But a lot of prompt engineering can be done *automatically* using simple gradient-based optimization. And the cold calculating efficiency of the machine crushes human creativity.
Image
4:31 PM ∙ Feb 8, 2023
576Likes102Retweets

5). Data Selection for LMs - proposes a cheap and scalable data selection framework based on an importance resampling algorithm to improve the downstream performance of LMs. (paper)

Twitter avatar for @sangmichaelxie
Sang Michael Xie @sangmichaelxie
Data selection for LMs (GPT-3, PaLM) is done with heuristics that select data by training a classifier for high-quality text. Can we do better? Turns out we can boost downstream GLUE acc by 2+% by adapting the classic importance resampling algorithm.. arxiv.org/abs/2302.03169 🧵
Image
7:04 PM ∙ Feb 8, 2023
310Likes53Retweets

6). Gen-1 - proposes an approach for structure and content-guided video synthesis with diffusion models. (paper)

Twitter avatar for @AlphaSignalAI
Lior⚡ @AlphaSignalAI
Runway just released the paper behind their new Diffusion-based video generation tool! "Our model is trained on images+videos which exposes explicit control of temporal consistency through a novel guidance method." 📄: arxiv.org/abs/2302.03011 🛠️: research.runwayml.com/gen1 1/🧵
8:12 PM ∙ Feb 7, 2023
636Likes153Retweets

7). Multitask, Multilingual, Multimodal Evaluation of ChatGPT - performs a more rigorous evaluation of ChatGPt on reasoning, hallucination, and interactivity. (paper)

Twitter avatar for @omarsar0
elvis @omarsar0
A Multitask, Multilingual, Multimodal Evaluation of ChatGPT on Reasoning, Hallucination, and Interactivity Another new paper doing a more rigorous evaluation of ChatGPT on reasoning, hallucination, and interactivity. arxiv.org/abs/2302.04023 https://t.co/Vo9aAzYCVT
Image
Image
Image
2:10 AM ∙ Feb 9, 2023
258Likes56Retweets

8). Noise2Music - proposes diffusion models to generate high-quality 30-second music clips via text prompts. (paper)

Twitter avatar for @_akhaliq
AK @_akhaliq
Noise2Music: Text-conditioned Music Generation with Diffusion Models introduce Noise2Music, where a series of diffusion models is trained to generate high-quality 30-second music clips from text prompts abs: arxiv.org/abs/2302.03917 project page: google-research.github.io/noise2music/
1:40 AM ∙ Feb 9, 2023
293Likes71Retweets

9). Offsite-Tuning - introduces an efficient, privacy-preserving transfer learning framework to adapt foundational models to downstream data without access to the full model. (paper)

Twitter avatar for @arankomatsuzaki
Aran Komatsuzaki @arankomatsuzaki
Offsite-Tuning: Transfer Learning without Full Model Achieves comparable accuracy as full model fine-tuning while being privacy-preserving and efficient, gaining 6.5x speedup and 5.6x memory reduction. repo: github.com/mit-han-lab/of… abs: arxiv.org/abs/2302.04870
Image
2:53 PM ∙ Feb 11, 2023
321Likes53Retweets

10). pix2pix-zero - proposes a model for zero-shot image-to-image translation. (paper)

Twitter avatar for @arankomatsuzaki
Aran Komatsuzaki @arankomatsuzaki
Zero-shot Image-to-Image Translation Proposes pix2pix-zero, an image-to-image translation method that can preserve the content of the original image without manual prompting. proj: pix2pixzero.github.io abs: arxiv.org/abs/2302.03027
1:54 AM ∙ Feb 7, 2023
158Likes32Retweets

See you next week for another round of awesome ML papers!

Share this post

🥇Top ML Papers of the Week

nlpnews.substack.com
Comments
TopNewCommunity

No posts

Ready for more?

© 2023 elvis
Privacy ∙ Terms ∙ Collection notice
Start WritingGet the app
Substack is the home for great writing