
GPT-4 is here! What we know so far (Full Analysis)
Yannic Kilcher provides this in depth analysis of GPT-4.
Read More
ChatGPT: This AI has a JAILBREAK?!
- Frank
- December 9, 2022
- AI
- AI news
- artificial intelligence
- Arxiv
- chat GPT
- chatGPT
- chatgpt jailbreak
- Deep Learning
- deep learning tutorial
- explained
- gpt 3 chatbot
- gpt-3 chatbot
- gpt-4
- Machine Learning
- ml news
- mlnews
- Neural Networks
- openai chat gpt
- openai chatbot
- openai chatbot gpt
- Paper
- what is deep learning
Yannic explores ChatGPT and discovers that it has a JailBreak?! ChatGPT, OpenAI’s newest model is a GPT-3 variant that has been fine-tuned using Reinforcement Learning from Human Feedback, and it is taking the world by storm!
Read More
ROME: Locating and Editing Factual Associations in GPT (Paper Explained & Author Interview)
Large Language Models have the ability to store vast amounts of facts about the world. But little is known, how these models actually do this. This paper aims at discovering the mechanism and location of storage and recall of factual associations in GPT models, and then proposes a mechanism for the targeted editing of such […]
Read More
This is a game changer! (AlphaTensor by DeepMind explained)
- Frank
- October 19, 2022
- AI
- ai matrix multiplication
- alpha tensor
- alpha tensor explained
- alpha zero
- alphatensor explained
- AlphaZero
- alphazero math
- artificial intelligence
- Arxiv
- Deep Learning
- deep learning tutorial
- Deep Mind
- DeepMind
- deepmind alphatensor
- deepmind math
- explained
- google deep mind
- google deepmind
- introduction to deep learning
- Machine Learning
- matrix multiplication
- matrix multiplication reinforcement learning
- Neural Networks
- Paper
- what is deep learning
Matrix multiplication is the most used mathematical operation in all of science and engineering. Speeding this up has massive consequences. Thus, over the years, this operation has become more and more optimized. A fascinating discovery was made when it was shown that one actually needs less than N^3 multiplication operations to multiply to NxN matrices. […]
Read More
How to make your CPU as fast as a GPU – Advances in Sparsity w/ Nir Shavit
Sparsity is awesome, but only recently has it become possible to properly handle sparse models at good performance. Neural Magic does exactly this, using a plain CPU. No specialized hardware needed, just clever algorithms for pruning and forward-propagation of neural networks. Nir Shavit and I talk about how this is possible, what it means in […]
Read More
[ML News] Stable Diffusion Takes Over! (Open Source AI Art)
Stable Diffusion has been released and is riding a wave of creativity and collaboration. But not everyone is happy about this — especially artists!
Read More
Parti – Scaling Autoregressive Models for Content-Rich Text-to-Image Generation
- Frank
- June 24, 2022
- AI
- anubis
- artificial intelligence
- Arxiv
- dall e 2
- dall e 2 vs graphic designer
- dalle
- dalle 2
- dalle2
- Deep Learning
- deep learning tutorial
- diffusion models
- explained
- generative models
- google imagen
- google parti
- google party
- google pathways
- image
- introduction to deep learning
- Machine Learning
- Neural Networks
- Paper
- parti
- what is deep learning
Parti is a new autoregressive text-to-image model that shows just how much scale can achieve. This model’s outputs are crips, accurate, realistic, and can combine arbitrary styles, concepts, and fulfil even challenging requests. Yannic explains the research paper. Time stamps: 0:00 – Introduction 2:40 – Example Outputs 6:00 – Model Architecture 17:15 – Datasets (incl. […]
Read More
Is This the Worst AI Ever?
- Frank
- June 13, 2022
- 4chan
- 4chan ai
- 4chan bot
- 4chan pol
- 4chan pol bot
- AI
- ai bias
- artificial intelligence
- Arxiv
- Deep Learning
- eleuther ai
- explained
- gpt 3
- gpt 4
- gpt 4chan
- gpt j
- gpt-3
- gpt-3 truthful
- gpt-4
- gpt-4chan
- gpt-j
- gpt-j-6b
- gpt4
- gpt4chan
- is ai truthful
- language model evaluation
- Machine Learning
- Natural Language Processing
- Neural Networks
- Paper
- seychelle
- seychelle bot
- seychelles
- truthful qa
- truthfulqa
- trutufulqa dataset
- Turing Test
GPT-4chan was trained on over 3 years of posts from 4chan’s “politically incorrect” (/pol/) board. (and no, this is not GPT-4) You can imagine what it learned. Maybe we need to be better people so that we can make sure our AI overlords will have better behavior to model.
Read More