Dwarves
Memo
Home
Consulting
Handbook
Playbook
Hiring
Changelog
Contributor
Prompts
Night mode
Pinned
On agentic AI
2025 Roadmap
Experiment selection
Do one thing well
Compose newsletter
Our bets ๐งโโ๏ธ
Team profile ๐
Explore
Updates
Research
Consulting
Careers
Handbook
Playbook
Culture
Earn
Fund
Misc
Opensource
Org
Radar
Services
Resources
Dwarves
Memo
Search note
โ K
#learning
A
A grand unified theory of the AI hype cycle
AI
machine-learning
LLM
AI expertise & solutions
AI
LLM
machine-learning
Append-only concept embedding log
brainery
architecture
concept-embedding
C
Continuing education allowance
guide
handbook
learning
D
Design sprint
design
learning
UX
Digest
community
consulting
event
E
Explaining gradient descent in machine learning with a simple analogy
AI
machine-learning
LLM
Exploring machine learning approaches for fine tuning Llama models
machine-learning
LLM
engineering
I
Introduction to reinforcement learning and its application with LLMs
AI
reinforcement-learning
LLM
K
#25 Khoi Nguyen on continuous learning
backend-engineer
continuous-learning
life-at-dwarves
L
Learning with AI
culture
AI
learning
Learning chair
roadmap
learning
labs
M
Memo handbook
handbook
learning
memo
machine-learning
machine-learning
101
engineering
P
Proximal policy optimization
AI
reinforcement-learning
LLM
Q
Q learning
AI
machine-learning
LLM
R
Reward model
AI
reinforcement-learning
LLM
RLHF with Open Assistant
AI
reinforcement-learning
LLM