Research¶
The page is dedicated to collecting all research that was collected in the past one year from various sources.
This is not an exhaustive list, and any PRs would be welcome
Research Papers¶
- [2024/06/04][Symbolic reasoning](https://arxiv.org/abs/2402.01817)
- [2024/06/04][Transformers and episodic memory](https://arxiv.org/abs/2405.14992)
- [2024/03/24][Graph Chain-of-Thought: Augmenting Large Language Models by Reasoning on Graphs](https://arxiv.org/abs/2404.07103)
- [2024/03/24][Leave No Context Behind: Efficient Infinite Context Transformers with Infini-attention](https://arxiv.org/abs/2404.07143)
- [2024/03/24][Compound AI systems](https://bair.berkeley.edu/blog/2024/02/18/compound-ai-systems/)
- [2015/07/30][Multilayer Network of Language](https://arxiv.org/abs/1507.08539)
- [2023/12/12] Dense X Retrieval: What Retrieval Granularity Should We Use?
- [2024/01/05][Retrieval-Augmented Generation for Large Language Models: A Survey](https://arxiv.org/pdf/2312.10997.pdf)
- [2022/10/20] Cognitive modelling with multilayer networks: Insights, advancements and future challenges
- [2023/09/20] CoAla framework and relevant literature literature
- [2023/06/09][Mind2Web: Towards a Generalist Agent for the Web](https://arxiv.org/pdf/2306.06070.pdf), Xiang Deng, et al. [code] [demo]
- [2023/06/28] AI Agents in Langchain https://docs.google.com/presentation/d/1L_CHsg26sDxPmKj285Ob5T2xsAUejBlfiGQSnsSHTk0/edit#slide=id.g254e571859c_0_164
- [2023/06/27] Agent infra https://lilianweng.github.io/posts/2023-06-23-agent/
- [2023/06/05][Orca: Progressive Learning from Complex Explanation Traces of GPT-4](https://arxiv.org/pdf/2306.02707.pdf), Subhabrata Mukherjee et al.
- [2023/05/25] 📚Voyager: An Open-Ended Embodied Agent with Large Language Models, Guanzhi Wang, et al. [code] [website], Shishir G. Patil, et al.
- [2023/05/24] 📚Gorilla: Gorilla: Large Language Model Connected with Massive APIs
- [2023/05/17] 📚Tree of Thoughts: Deliberate Problem Solving with Large Language Models, Shunyu Yao, et al.[code] [code-orig]
- [2023/05/12] 📚MEGABYTE: Predicting Million-byte Sequences with Multiscale Transformers, Lili Yu, et al.
- [2023/05/09] 📚FrugalGPT: How to Use Large Language Models While Reducing Cost and Improving Performance, Lingjiao Chen, et al.
- [2023/05/01] 📚Learning to Reason and Memorize with Self-Notes, Jack Lanchantin, et al.
- [2023/04/24] 📚WizardLM: Empowering Large Language Models to Follow Complex Instructions, Can Xu, et al.
- [2023/04/22] 📚LLM+P: Empowering Large Language Models with Optimal Planning Proficiency, Bo Liu, et al.
- [2023/04/07] 📚Generative Agents: Interactive Simulacra of Human Behavior, Joon Sung Park, et al. [code]
- [2023/03/30][Self-Refine: Iterative Refinement with Self-Feedback](https://arxiv.org/abs/2303.17651), Aman Madaan, et al.[code]
- [2023/03/30][HuggingGPT: Solving AI Tasks with ChatGPT and its Friends in HuggingFace](https://arxiv.org/pdf/2303.17580.pdf), Yongliang Shen, et al. [code] [demo]
- [2023/03/20][Reflexion: Language Agents with Verbal Reinforcement Learning](https://arxiv.org/pdf/2303.11366.pdf), Noah Shinn , et al. [code]
- [2023/02/23] 📚Not what you've signed up for: Compromising Real-World LLM-Integrated Applications with Indirect Prompt Injection, Sahar Abdelnab, et al.
- [2023/02/09] 📚Toolformer: Language Models Can Teach Themselves to Use Tools, Timo Schick, et al. [code]
- [2022/12/12] 📚LMQL: Prompting Is Programming: A Query Language for Large Language Models, Luca Beurer-Kellner, et al.
- [2022/10/06][ReAct: Synergizing Reasoning and Acting in Language Models](https://arxiv.org/pdf/2210.03629.pdf), Shunyu Yao, et al. [code]
- [2022/07/12] 📚Inner Monologue: Embodied Reasoning through Planning with Language Models, Wenlong Huang, et al. [demo]
- [2022/04/04][Do As I Can, Not As I Say: Grounding Language in Robotic Affordances](https://github.com/Significant-Gravitas/Nexus/wiki/Awesome-Resources), Michael Ahn, e al. [demo]
- [2021/12/17][WebGPT: Browser-assisted question-answering with human feedback](https://arxiv.org/pdf/2112.09332.pdf), Reiichiro Nakano, et al.
- [2021/06/17] 📚LoRA: Low-Rank Adaptation of Large Language Models, Edward J. Hu, et al.
- [2023/04/03][Generative Agents](https://arxiv.org/abs/2304.03442)
- [2023/05/17][Three of thought: Deliberate Problem Solving with Large Language Mode](https://arxiv.org/abs/2305.10601)ls
Knowledge Graphs¶
- [2023/06/09][Taxonomies: Overview](https://www.brighttalk.com/webcast/9273/605659?utm_source=brighttalk-portal&utm_medium=web&utm_campaign=topic&utm_content=upcoming)
Blog Articles¶
- [2023/04/29][AUTO-GPT: UNLEASHING THE POWER OF AUTONOMOUS AI AGENTS](https://www.leewayhertz.com/autogpt/) By Akash Takyar
- [2023/04/20][Conscious Machines: Experiments, Theory, and Implementations(Chinese)](https://pattern.swarma.org/article/230) By Jiang Zhang
- [2023/04/18][Autonomous Agents & Agent Simulations](https://blog.langchain.dev/agents-round/) By Langchain
- [2023/04/16][4 Autonomous AI Agents you need to know](https://towardsdatascience.com/4-autonomous-ai-agents-you-need-to-know-d612a643fa92) By Sophia Yang
- [2023/03/31][ChatGPT that learns to use tools](https://zhuanlan.zhihu.com/p/618448188) By Haojie Pan
Talks¶
- [2023/06/05][Two Paths to Intelligence](https://www.youtube.com/watch?v=rGgGOccMEiY&t=1497s) by Geoffrey Hinton
- [2023/05/24][State of GPT](https://www.youtube.com/watch?v=bZQun8Y4L2A) by Andrej Karpathy | OpenAI
- [2024/03/15] Podcast on AI, Memory by Bill Gurley