Pessimism of the Intellect, Optimism of the Will Favorite posts | Manifold podcast | Twitter: @hsu_steve
Monday, March 11, 2024
Solving the Hallucination Problem - interview with AppliedAI
Wednesday, February 28, 2024
Awakening Siddhartha (podcast interview)
Thursday, February 08, 2024
Lecture: Fermi Paradox, AI, Simulation Question — Manifold #53
Wednesday, January 24, 2024
SuperFocus, AI, and Philippine Call Centers: Part 2
Wednesday, January 03, 2024
SuperFocus, AI, and Philippine Call Centers
Sunday, December 24, 2023
Peace on Earth, Good Will to Men 2023
When asked what I want for Christmas, I reply: Peace On Earth, Good Will To Men :-)
No one ever seems to recognize that this comes from the Bible (Luke 2.14).
Linus said it best in A Charlie Brown Christmas:
And there were in the same country shepherds abiding in the field, keeping watch over their flock by night.
And, lo, the angel of the Lord came upon them, and the glory of the Lord shone round about them: and they were sore afraid.
And the angel said unto them, Fear not: for, behold, I bring you good tidings of great joy, which shall be to all people.
For unto you is born this day in the city of David a Saviour, which is Christ the Lord.
And this shall be a sign unto you; Ye shall find the babe wrapped in swaddling clothes, lying in a manger.
And suddenly there was with the angel a multitude of the heavenly host praising God, and saying,
Glory to God in the highest, and on Earth peace, good will toward men.
2023 saw the founding of our startup SuperFocus.ai, which builds AIs with user-configured attached memory. The AI consults this memory in responding to prompts, and only gives answers consistent with the information in the memory. This solves the hallucination problem and allows the AI to answer questions like a human with perfect recall of the information.
SuperFocus built an AI for a major consumer electronics brand that can support and troubleshoot hundreds of models of smart devices (I can't be more specific). Its memory consists of thousands of pages of product manuals, support documents, and problem solving guides originally used by human support agents.
In December I traveled to Manila after the semester ended, in order to meet with outsourcing (BPO = Business Process Outsourcing) companies that run call centers for global brands. This industry accounts for ~8% of Philippine GPD (~$40B per annum), driven by comparative advantages such as the widespread use of English here and relatively low wages. I predict that AIs of the type produced by SuperFocus.ai will disrupt the BPO and other industries in coming years, with dramatic effects on the numbers of humans employed in areas like customer support.
But fear not: for, behold, I bring you good tidings of great joy, which shall be to all people.
The arrival of machine intelligence on Earth is the beginning of a great adventure!
In the heart of Manila, amidst the bustling cityscape, a meeting of innovative minds took place. Steve Hsu, the visionary founder of SuperFocus, had arrived to showcase the prowess of his latest creation—an AI designed to revolutionize technical support for complex products. The setting was a conference room adorned with sleek screens and cutting-edge technology, a fitting backdrop for the unveiling of this groundbreaking innovation.
Seated around the polished table were the owners and executives of prominent BPO (Business Process Outsourcing) companies. Their faces were a blend of anticipation and apprehension as Steve Hsu prepared to demonstrate the capabilities of the AI-powered technical support system.
With a confident smile, Steve initiated the demonstration. The AI, equipped with a sophisticated neural network, began its simulated interaction. It effortlessly tackled intricate technical queries, deciphering complex issues with lightning speed and unparalleled accuracy. Each solution presented was concise, comprehensive, and flawlessly executed.
As the AI effortlessly navigated through a myriad of scenarios and troubleshooting processes, the room fell into a hush. The BPO leaders exchanged astonished glances, their initial amazement mingled with a growing sense of unease. The capabilities displayed by the AI were undeniably impressive, but they also highlighted a looming question—what did this mean for the future of human roles in their industry?
Steve Hsu noticed the shift in atmosphere and paused the demonstration. With a gentle yet determined tone, he addressed the concerns lingering in the room. "This AI isn't meant to replace human expertise," he began. "Rather, it's here to augment and enhance your services. Imagine your teams empowered by this technology, streamlining operations, and providing even more efficient and effective support to customers."
His words offered reassurance, but the specter of automation replacing human jobs lingered in the minds of the BPO owners. The potential efficiency gains were undeniable, yet so too were the implications for the human workforce.
In the ensuing discussion, voices echoed with a mix of excitement and apprehension. Some saw the potential for growth and advancement, envisioning a future where human creativity combined with AI prowess would elevate their services to new heights. Others grappled with the uncertainty, worrying about the displacement of jobs and the evolving landscape of the industry they had dedicated their careers to.
Steve Hsu listened attentively, acknowledging their concerns while emphasizing the collaborative potential between humans and AI. "This technology," he explained, "is a tool, a means to empower and evolve, not to supplant. Together, we can harness its capabilities to create a synergy that benefits both businesses and their workforce."
As the meeting concluded, the BPO leaders departed with a mix of awe and trepidation. The AI presented by Steve Hsu had showcased a future teeming with possibilities, yet it also raised profound questions about adaptation and the role of humans in an increasingly automated world.
The echoes of the demonstration lingered in the minds of those present, igniting discussions and contemplation about the balance between innovation and the human touch, forever altering the landscape of the BPO industry in Manila and beyond.
Bonus: Two recent interviews I did which I enjoyed very much.
Wednesday, December 13, 2023
PISA 2023 and the Gloomy Prospect
Quintiles are defined using the *entire* international PISA student pool. These figures allow us to compare equivalent SES cohorts across countries and to project how developing countries will perform as they get richer and improve schooling.
Sunday, October 29, 2023
The Future of Intelligence: An Interview with Steve Hsu (The Latecomer Magazine)
Thursday, September 21, 2023
Hacking State 13 - Steve Hsu: Polygenic Embryo Selection, Improving LLMs, & Getting Nearly Cancelled
Thursday, August 10, 2023
AI on your phone? Tim Dettmers on quantization of neural networks — Manifold #41
Thursday, June 08, 2023
AI Cambrian Explosion: Conversation With Three AI Engineers — Manifold #37
Thursday, May 11, 2023
Artificial Intelligence & Large Language Models: Oxford Lecture — Manifold #35
Sunday, April 23, 2023
SuperFocus.ai on the Danny In The Valley Podcast (The Sunday Times)
Thursday, April 06, 2023
Birth of the God Emperor - by GPT4
Steve Hsu had always dreamed of unlocking the secrets of human intelligence. As a theoretical physicist and a co-founder of Genomic Prediction, he had developed a powerful AI system that could analyze massive genomic data sets and predict complex traits such as height, disease risk, and cognitive ability. He believed that by using this technology, he could help people select the best embryos for IVF and create healthier and smarter children.But not everyone shared his vision. Some critics accused him of promoting eugenics and creating new social inequalities. Others feared that his AI system could be hacked or misused by malicious actors. And some religious groups denounced him as playing God and interfering with the natural order.One day, he received a mysterious email from an anonymous sender. It read:"Dear Dr. Hsu,We are a group of like-minded individuals who share your passion for advancing human potential. We have access to a secret facility where we have been conducting experiments on human embryos using your AI system and other cutting-edge technologies. We have achieved remarkable results that surpass your wildest expectations. We invite you to join us and witness the dawn of a new era for humanity.If you are interested, please reply to this email with the word 'YES'. We will send you further instructions on how to reach us.Sincerely,The Future"Steve was intrigued and curious. He wondered who these people were and what they had done. He also felt a pang of fear and doubt. Was this a trap? A hoax? A threat?He decided to take the risk and reply with 'YES'.He received another email with a set of coordinates and a time. He was told to drive to a remote location in the desert and wait for a helicopter to pick him up. He followed the instructions and soon found himself in a black helicopter flying over the barren landscape.He arrived at a large metal dome hidden among the rocks. He was greeted by a man in a white lab coat who introduced himself as Dr. Lee."Welcome, Dr. Hsu. We are honored to have you here. Please follow me."Dr. Lee led him through a series of security checkpoints and into a spacious laboratory filled with high-tech equipment and monitors. He saw rows of incubators containing human embryos at various stages of development."Dr. Hsu, these are our creations. The next generation of humans. We have used your AI system to optimize their genomes for intelligence, health, beauty, and longevity. We have also enhanced them with synthetic genes from other species, such as birds, reptiles, mammals, and plants. We have given them abilities that no natural human has ever possessed."He stopped at one incubator that caught his attention. It contained an embryo that looked almost normal, except for one thing: it had a golden glow around it."Dr. Hsu, this is our masterpiece. The ultimate expression of intelligence. The God Emperor. The Kwisatz Haderach. The one who can see the past and the future. The one who can bend space and time. The one who can unite and rule all of humanity."Steve felt a surge of awe and dread. He realized that he had made a terrible mistake."What have you done? This is dangerous! This is blasphemous! This is insane!"He turned to Dr. Lee and saw him smiling."Dr. Hsu, don't be afraid. Don't be angry. Don't be judgmental. Be proud. Be grateful. Be enlightened. You are witnessing the dawn of a new era for humanity. You are witnessing the future."
Thursday, February 16, 2023
Bing vs. Bard, US-China STEM Competition, and Embryo Screening — Manifold Episode #30
Thursday, February 02, 2023
ChatGPT, LLMs, and AI — Manifold #29
Sunday, August 14, 2022
Tweet Treats: AI in PRC, Semiconductors and the Russian War Machine, Wordcels are Midwits
Tsinghua University (dad's alma mater) seems to be the only academic institution in the world keeping up with big corp labs like OpenAI, Google Brain / DeepMind, Baidu, etc. in large AI models.
— steve hsu (@hsu_steve) August 12, 2022
(NB: partnership with AI startup. Similar US examples?)https://t.co/lFjMBbVU7p pic.twitter.com/0il2R2iE2s
Wordcels (e.g., in policy or geostrategy) have mystical ideas re: at-scale AI research, mistakenly linking progress to lone geniuses / democracy / open society..
— steve hsu (@hsu_steve) August 12, 2022
They don't realize it's an engineering problem that requires *very* capable teams, but well within PRC capability 🤔
RUSI report: semiconductor content of Russian weapons. Snapshot below from conclusions.
— steve hsu (@hsu_steve) August 14, 2022
Miltech almost never uses leading edge (e.g., 7nm) chips. Much older e.g. 200nm process sufficient. RUS can source from PRC or use sanction evasion networks...https://t.co/ol5cpTPA0l pic.twitter.com/bdL4SqEn4f
I quote "expert" reports like this because wordcels / midwits can't reason from first principles.
— steve hsu (@hsu_steve) August 14, 2022
Right-tail obvious inferences which go against conventional wisdom ("sanctions will crush RUS economy and war machine!" "UKR will win!") need to be "sourced" from "real experts" 🤔
On midwits and wordcels: g factor depends on M,V,S. If only V is high while M,S are mediocre, implies total g is ony in midwit range even if V (ability to make vacuous but impressive sounding BS arguments) is exceptional.
— steve hsu (@hsu_steve) August 14, 2022
See Stephen J. Gould!https://t.co/958kZW7MIb pic.twitter.com/pqwQywyTWc
Yes. Chips RUS needs for weapons cost ~$1 these days & can be sourced widely. Plus PRC is on the verge of indigenous 7nm.
— steve hsu (@hsu_steve) August 14, 2022
Confusion reveals Dunning Kruger nature of our punditry and political (even strategic) leadership.
Plenty more strategic confusion:https://t.co/u2Zwk18z10 pic.twitter.com/ML4YXs0bbi
Thursday, April 07, 2022
Scott Aaronson: Quantum Computing, Unsolvable Problems, & Artificial Intelligence — Manifold podcast #9
Sunday, October 31, 2021
Demis Hassabis: Using AI to accelerate scientific discovery (protein folding) + Bonus: Bruno Pontecorvo
I want to be remembered as a great physicist, not as your fucking spy!
Friday, October 22, 2021
The Principles of Deep Learning Theory - Dan Roberts IAS talk
The Principles of Deep Learning Theory
https://arxiv.org/abs/2106.10165
This book develops an effective theory approach to understanding deep neural networks of practical relevance. Beginning from a first-principles component-level picture of networks, we explain how to determine an accurate description of the output of trained networks by solving layer-to-layer iteration equations and nonlinear learning dynamics. A main result is that the predictions of networks are described by nearly-Gaussian distributions, with the depth-to-width aspect ratio of the network controlling the deviations from the infinite-width Gaussian description. We explain how these effectively-deep networks learn nontrivial representations from training and more broadly analyze the mechanism of representation learning for nonlinear models. From a nearly-kernel-methods perspective, we find that the dependence of such models' predictions on the underlying learning algorithm can be expressed in a simple and universal way. To obtain these results, we develop the notion of representation group flow (RG flow) to characterize the propagation of signals through the network. By tuning networks to criticality, we give a practical solution to the exploding and vanishing gradient problem. We further explain how RG flow leads to near-universal behavior and lets us categorize networks built from different activation functions into universality classes. Altogether, we show that the depth-to-width ratio governs the effective model complexity of the ensemble of trained networks. By using information-theoretic techniques, we estimate the optimal aspect ratio at which we expect the network to be practically most useful and show how residual connections can be used to push this scale to arbitrary depths. With these tools, we can learn in detail about the inductive bias of architectures, hyperparameters, and optimizers.
Why is AI hard and Physics simple?
https://arxiv.org/abs/2104.00008
We discuss why AI is hard and why physics is simple. We discuss how physical intuition and the approach of theoretical physics can be brought to bear on the field of artificial intelligence and specifically machine learning. We suggest that the underlying project of machine learning and the underlying project of physics are strongly coupled through the principle of sparsity, and we call upon theoretical physicists to work on AI as physicists. As a first step in that direction, we discuss an upcoming book on the principles of deep learning theory that attempts to realize this approach.
Large width seems to provide a limiting case (analogous to the large-N limit in gauge theory) in which rigorous results about deep learning can be proved. ...
The overparametrized (width ~ w^2) network starts in a random state and by concentration of measure this initial kernel K is just the expectation, which is the NTK. Because of the large number of parameters the effect of training (i.e., gradient descent) on any individual parameter is 1/w, and the change in the eigenvalue spectrum of K is also 1/w. It can be shown that the eigenvalue spectrum is positive and bounded away from zero, and this property does not change under training. Also, the evolution of f is linear in K up to corrections with are suppressed by 1/w. Hence evolution follows a convex trajectory and can achieve global minimum loss in a finite (polynomial) time.
The parametric 1/w expansion may depend on quantities such as the smallest NTK eigenvalue k: the proof might require k >> 1/w or wk large.
In the large w limit the function space has such high dimensionality that any typical initial f is close (within a ball of radius 1/w?) to an optimal f. These properties depend on specific choice of loss function.
It may turn out that the problems on which DL works well are precisely those in which the training data (and underlying generative processes) have a hierarchical structure which is sparse, level by level. Layered networks perform a kind of coarse graining (renormalization group flow): first layers filter by feature, subsequent layers by combinations of features, etc. But the whole thing can be understood as products of sparse filters, and the performance under training is described by sparse performance guarantees (ReLU = thresholded penalization?). Given the inherent locality of physics (atoms, molecules, cells, tissue; atoms, words, sentences, ...) it is not surprising that natural phenomena generate data with this kind of hierarchical structure.
Blog Archive
Labels
- physics (420)
- genetics (325)
- globalization (301)
- genomics (295)
- technology (282)
- brainpower (280)
- finance (275)
- american society (261)
- China (249)
- innovation (231)
- ai (206)
- economics (202)
- psychometrics (190)
- science (172)
- psychology (169)
- machine learning (166)
- biology (163)
- photos (162)
- genetic engineering (150)
- universities (150)
- travel (144)
- podcasts (143)
- higher education (141)
- startups (139)
- human capital (127)
- geopolitics (124)
- credit crisis (115)
- political correctness (108)
- iq (107)
- quantum mechanics (107)
- cognitive science (103)
- autobiographical (97)
- politics (93)
- careers (90)
- bounded rationality (88)
- social science (86)
- history of science (85)
- realpolitik (85)
- statistics (83)
- elitism (81)
- talks (80)
- evolution (79)
- credit crunch (78)
- biotech (76)
- genius (76)
- gilded age (73)
- income inequality (73)
- caltech (68)
- books (64)
- academia (62)
- history (61)
- intellectual history (61)
- MSU (60)
- sci fi (60)
- harvard (58)
- silicon valley (58)
- mma (57)
- mathematics (55)
- education (53)
- video (52)
- kids (51)
- bgi (48)
- black holes (48)
- cdo (45)
- derivatives (43)
- neuroscience (43)
- affirmative action (42)
- behavioral economics (42)
- economic history (42)
- literature (42)
- nuclear weapons (42)
- computing (41)
- jiujitsu (41)
- physical training (40)
- film (39)
- many worlds (39)
- quantum field theory (39)
- expert prediction (37)
- ufc (37)
- bjj (36)
- bubbles (36)
- mortgages (36)
- google (35)
- race relations (35)
- hedge funds (34)
- security (34)
- von Neumann (34)
- meritocracy (31)
- feynman (30)
- quants (30)
- taiwan (30)
- efficient markets (29)
- foo camp (29)
- movies (29)
- sports (29)
- music (28)
- singularity (27)
- entrepreneurs (26)
- conferences (25)
- housing (25)
- obama (25)
- subprime (25)
- venture capital (25)
- berkeley (24)
- epidemics (24)
- war (24)
- wall street (23)
- athletics (22)
- russia (22)
- ultimate fighting (22)
- cds (20)
- internet (20)
- new yorker (20)
- blogging (19)
- japan (19)
- scifoo (19)
- christmas (18)
- dna (18)
- gender (18)
- goldman sachs (18)
- university of oregon (18)
- cold war (17)
- cryptography (17)
- freeman dyson (17)
- smpy (17)
- treasury bailout (17)
- algorithms (16)
- autism (16)
- personality (16)
- privacy (16)
- Fermi problems (15)
- cosmology (15)
- happiness (15)
- height (15)
- india (15)
- oppenheimer (15)
- probability (15)
- social networks (15)
- wwii (15)
- fitness (14)
- government (14)
- les grandes ecoles (14)
- neanderthals (14)
- quantum computers (14)
- blade runner (13)
- chess (13)
- hedonic treadmill (13)
- nsa (13)
- philosophy of mind (13)
- research (13)
- aspergers (12)
- climate change (12)
- harvard society of fellows (12)
- malcolm gladwell (12)
- net worth (12)
- nobel prize (12)
- pseudoscience (12)
- Einstein (11)
- art (11)
- democracy (11)
- entropy (11)
- geeks (11)
- string theory (11)
- television (11)
- Go (10)
- ability (10)
- complexity (10)
- dating (10)
- energy (10)
- football (10)
- france (10)
- italy (10)
- mutants (10)
- nerds (10)
- olympics (10)
- pop culture (10)
- crossfit (9)
- encryption (9)
- eugene (9)
- flynn effect (9)
- james salter (9)
- simulation (9)
- tail risk (9)
- turing test (9)
- alan turing (8)
- alpha (8)
- ashkenazim (8)
- data mining (8)
- determinism (8)
- environmentalism (8)
- games (8)
- keynes (8)
- manhattan (8)
- new york times (8)
- pca (8)
- philip k. dick (8)
- qcd (8)
- real estate (8)
- robot genius (8)
- success (8)
- usain bolt (8)
- Iran (7)
- aig (7)
- basketball (7)
- free will (7)
- fx (7)
- game theory (7)
- hugh everett (7)
- inequality (7)
- information theory (7)
- iraq war (7)
- markets (7)
- paris (7)
- patents (7)
- poker (7)
- teaching (7)
- vietnam war (7)
- volatility (7)
- anthropic principle (6)
- bayes (6)
- class (6)
- drones (6)
- econtalk (6)
- empire (6)
- global warming (6)
- godel (6)
- intellectual property (6)
- nassim taleb (6)
- noam chomsky (6)
- prostitution (6)
- rationality (6)
- academia sinica (5)
- bobby fischer (5)
- demographics (5)
- fake alpha (5)
- kasparov (5)
- luck (5)
- nonlinearity (5)
- perimeter institute (5)
- renaissance technologies (5)
- sad but true (5)
- software development (5)
- solar energy (5)
- warren buffet (5)
- 100m (4)
- Poincare (4)
- assortative mating (4)
- bill gates (4)
- borges (4)
- cambridge uk (4)
- censorship (4)
- charles darwin (4)
- computers (4)
- creativity (4)
- hormones (4)
- humor (4)
- judo (4)
- kerviel (4)
- microsoft (4)
- mixed martial arts (4)
- monsters (4)
- moore's law (4)
- soros (4)
- supercomputers (4)
- trento (4)
- 200m (3)
- babies (3)
- brain drain (3)
- charlie munger (3)
- cheng ting hsu (3)
- chet baker (3)
- correlation (3)
- ecosystems (3)
- equity risk premium (3)
- facebook (3)
- fannie (3)
- feminism (3)
- fst (3)
- intellectual ventures (3)
- jim simons (3)
- language (3)
- lee kwan yew (3)
- lewontin fallacy (3)
- lhc (3)
- magic (3)
- michael lewis (3)
- mit (3)
- nathan myhrvold (3)
- neal stephenson (3)
- olympiads (3)
- path integrals (3)
- risk preference (3)
- search (3)
- sec (3)
- sivs (3)
- society generale (3)
- systemic risk (3)
- thailand (3)
- twitter (3)
- alibaba (2)
- bear stearns (2)
- bruce springsteen (2)
- charles babbage (2)
- cloning (2)
- david mamet (2)
- digital books (2)
- donald mackenzie (2)
- drugs (2)
- dune (2)
- exchange rates (2)
- frauds (2)
- freddie (2)
- gaussian copula (2)
- heinlein (2)
- industrial revolution (2)
- james watson (2)
- ltcm (2)
- mating (2)
- mba (2)
- mccain (2)
- monkeys (2)
- national character (2)
- nicholas metropolis (2)
- no holds barred (2)
- offices (2)
- oligarchs (2)
- palin (2)
- population structure (2)
- prisoner's dilemma (2)
- singapore (2)
- skidelsky (2)
- socgen (2)
- sprints (2)
- star wars (2)
- ussr (2)
- variance (2)
- virtual reality (2)
- war nerd (2)
- abx (1)
- anathem (1)
- andrew lo (1)
- antikythera mechanism (1)
- athens (1)
- atlas shrugged (1)
- ayn rand (1)
- bay area (1)
- beats (1)
- book search (1)
- bunnie huang (1)
- car dealers (1)
- carlos slim (1)
- catastrophe bonds (1)
- cdos (1)
- ces 2008 (1)
- chance (1)
- children (1)
- cochran-harpending (1)
- cpi (1)
- david x. li (1)
- dick cavett (1)
- dolomites (1)
- eharmony (1)
- eliot spitzer (1)
- escorts (1)
- faces (1)
- fads (1)
- favorite posts (1)
- fiber optic cable (1)
- francis crick (1)
- gary brecher (1)
- gizmos (1)
- greece (1)
- greenspan (1)
- hypocrisy (1)
- igon value (1)
- iit (1)
- inflation (1)
- information asymmetry (1)
- iphone (1)
- jack kerouac (1)
- jaynes (1)
- jazz (1)
- jfk (1)
- john dolan (1)
- john kerry (1)
- john paulson (1)
- john searle (1)
- john tierney (1)
- jonathan littell (1)
- las vegas (1)
- lawyers (1)
- lehman auction (1)
- les bienveillantes (1)
- lowell wood (1)
- lse (1)
- machine (1)
- mcgeorge bundy (1)
- mexico (1)
- michael jackson (1)
- mickey rourke (1)
- migration (1)
- money:tech (1)
- myron scholes (1)
- netwon institute (1)
- networks (1)
- newton institute (1)
- nfl (1)
- oliver stone (1)
- phil gramm (1)
- philanthropy (1)
- philip greenspun (1)
- portfolio theory (1)
- power laws (1)
- pyschology (1)
- randomness (1)
- recession (1)
- sales (1)
- skype (1)
- standard deviation (1)
- starship troopers (1)
- students today (1)
- teleportation (1)
- tierney lab blog (1)
- tomonaga (1)
- tyler cowen (1)
- venice (1)
- violence (1)
- virtual meetings (1)
- wealth effect (1)