About
Interests: Machine Learning, Intelligent systems, Computational Linguistics, Information…
Activity
-
🚀 🚀 🚀 Excited to share that we brought Echo to life! From images or text, Echo generates real or imagined 3D worlds and enables fine-grained…
🚀 🚀 🚀 Excited to share that we brought Echo to life! From images or text, Echo generates real or imagined 3D worlds and enables fine-grained…
Liked by Edward Grefenstette
-
The Google DeepMind Autonomous Agents Team is looking for a research engineer to work, at least initially, on projects around building reward models…
The Google DeepMind Autonomous Agents Team is looking for a research engineer to work, at least initially, on projects around building reward models…
Shared by Edward Grefenstette
-
An update: I have left Meta Superintelligence Labs and joined Reflection AI in NYC!! Today is my first day. I started in the Fundamental AI Research…
An update: I have left Meta Superintelligence Labs and joined Reflection AI in NYC!! Today is my first day. I started in the Fundamental AI Research…
Liked by Edward Grefenstette
Experience
Education
-
University of Oxford
-
-
Activities and Societies: Balliol MCR Committee, Holywell Manor Bar Manager, Returning Officer in MCR Elections, Balliol Chapel Choir, Balliol Eights Rowing, Balliol MCR Cricket
Fully funded (tuition and stipend) by EPSRC Doctoral Training Account awarded on the basis of academic merit.
-
-
-
-
-
-
-
-
-
Publications
-
Lambek vs. Lambek: Functorial Vector Space Semantics and String Diagrams for Lambek Calculus
Annals of Pure and Applied Logic
The Distributional Compositional Categorical (DisCoCat) model is a mathematical framework that provides compositional semantics for meanings of natural language sentences. It consists of a computational procedure for constructing meanings of sentences, given their grammatical structure in terms of compositional type-logic, and given the empirically derived meanings of their words. For the particular case that the meaning of words is modelled within a distributional vector space model, its…
The Distributional Compositional Categorical (DisCoCat) model is a mathematical framework that provides compositional semantics for meanings of natural language sentences. It consists of a computational procedure for constructing meanings of sentences, given their grammatical structure in terms of compositional type-logic, and given the empirically derived meanings of their words. For the particular case that the meaning of words is modelled within a distributional vector space model, its experimental predictions, derived from real large scale data, have outperformed other empirically validated methods that could build vectors for a full sentence. This success can be attributed to a conceptually motivated mathematical underpinning, something which the other methods lack, by integrating qualitative compositional type-logic and quantitative modelling of meaning within a category-theoretic mathematical framework. The type-logic used in the DisCoCat model is Lambek's pregroup grammar. Pregroup types form a posetal compact closed category, which can be passed, in a functorial manner, on to the compact closed structure of vector spaces, linear maps and tensor product. The diagrammatic versions of the equational reasoning in compact closed categories can be interpreted as the flow of word meanings within sentences. Pregroups simplify Lambek's previous type-logic, the Lambek calculus. The latter and its extensions have been extensively used to formalise and reason about various linguistic phenomena. Hence, the apparent reliance of the DisCoCat on pregroups has been seen as a shortcoming. This paper addresses this concern, by pointing out that one may as well realise a functorial passage from the original type-logic of Lambek, a monoidal bi-closed category, to vector spaces, or to any other model of meaning organised within a monoidal bi-closed category. The corresponding string diagram calculus, due to Baez and Stay, now depicts the flow of word meanings.
Other authorsSee publication -
Multi−Step Regression Learning for Compositional Distributional Semantics
Proceedings of the 10th International Conference on Computational Semantics (IWCS 2013)
We present a model for compositional distributional semantics related to the framework of Coecke et al. (2010), and emulating formal semantics by representing functions as tensors and arguments as vectors. We introduce a new learning method for tensors, generalising the approach of Baroni and Zamparelli (2010). We evaluate it on two benchmark data sets, and find it to outperform existing leading methods. We argue in our analysis that the nature of this learning method also renders it suitable…
We present a model for compositional distributional semantics related to the framework of Coecke et al. (2010), and emulating formal semantics by representing functions as tensors and arguments as vectors. We introduce a new learning method for tensors, generalising the approach of Baroni and Zamparelli (2010). We evaluate it on two benchmark data sets, and find it to outperform existing leading methods. We argue in our analysis that the nature of this learning method also renders it suitable for solving more subtle problems compositional distributional models might face.
Other authors -
-
Quantum Physics and Linguistics: A Compositional‚ Diagrammatic Discourse
Oxford University Press
New scientific paradigms typically consist of an expansion of the conceptual language with which we describe the world. Over the past decade, theoretical physics and quantum information theory have turned to category theory to model and reason about quantum protocols. This new use of categorical and algebraic tools allows a more conceptual and insightful expression of elementary events such as measurements, teleportation and entanglement operations, that were obscured in previous formalisms…
New scientific paradigms typically consist of an expansion of the conceptual language with which we describe the world. Over the past decade, theoretical physics and quantum information theory have turned to category theory to model and reason about quantum protocols. This new use of categorical and algebraic tools allows a more conceptual and insightful expression of elementary events such as measurements, teleportation and entanglement operations, that were obscured in previous formalisms. Recent work in natural language semantics has begun to use these categorical methods to relate grammatical analysis and semantic representations in a unified framework for analysing language meaning, and learning meaning from a corpus. A growing body of literature on the use of categorical methods in quantum information theory and computational linguistics shows both the need and opportunity for new research on the relation between these categorical methods and the abstract notion of information flow.
Other authorsSee publication -
Towards a Formal Distributional Semantics: Simulating Logical Calculi with Tensors
Proceedings of the Second Joint Conference on Lexical and Computational Semantics
See publicationThe development of compositional distributional models of semantics reconciling the empirical aspects of distributional semantics with the compositional aspects of formal semantics is a popular topic in the contemporary literature. This paper seeks to bring this reconciliation one step further by showing how the mathematical constructs commonly used in compositional distributional models, such as tensors and matrices, can be used to simulate different aspects of predicate logic. This paper…
The development of compositional distributional models of semantics reconciling the empirical aspects of distributional semantics with the compositional aspects of formal semantics is a popular topic in the contemporary literature. This paper seeks to bring this reconciliation one step further by showing how the mathematical constructs commonly used in compositional distributional models, such as tensors and matrices, can be used to simulate different aspects of predicate logic. This paper discusses how the canonical isomorphism between tensors and multilinear maps can be exploited to simulate a full-blown quantifier-free predicate calculus using tensors. It provides tensor interpretations of the set of logical connectives required to model propositional calculi. It suggests a variant of these tensor calculi capable of modelling quantifiers, using few non-linear operations. It finally discusses the relation between these variants, and how this relation should constitute the subject of future work.
-
A Compositional Distributional Semantics‚ Two Concrete Constructions‚ and some Experimental Evaluations
Lecture Notes in Computer Science
We provide an overview of the hybrid compositional distributional model of meaning, developed in Coecke et al. (arXiv:1003.4394v1 [cs.CL]), which is based on the categorical methods also applied to the analysis of information flow in quantum protocols. The mathematical setting stipulates that the meaning of a sentence is a linear function of the tensor products of the meanings of its words. We provide concrete constructions for this definition and present techniques to build vector spaces for…
We provide an overview of the hybrid compositional distributional model of meaning, developed in Coecke et al. (arXiv:1003.4394v1 [cs.CL]), which is based on the categorical methods also applied to the analysis of information flow in quantum protocols. The mathematical setting stipulates that the meaning of a sentence is a linear function of the tensor products of the meanings of its words. We provide concrete constructions for this definition and present techniques to build vector spaces for meaning vectors of words, as well as that of sentences. The applicability of these methods is demonstrated via a toy vector space as well as real data from the British National Corpus and two disambiguation experiments.
Other authors -
-
Concrete Sentence Spaces for Compositional Distributional Models of Meaning
Proceedings of the 9th International Conference on Computational Semantics (IWCS11)
Coecke, Sadrzadeh, and Clark (arXiv:1003.4394v1 [cs.CL]) developed a compositional model of meaning for distributional semantics, in which each word in a sentence has a meaning vector and the distributional meaning of the sentence is a function of the tensor products of the word vectors. Abstractly speaking, this function is the morphism corresponding to the grammatical structure of the sentence in the category of finite dimensional vector spaces. In this paper, we provide a concrete method for…
Coecke, Sadrzadeh, and Clark (arXiv:1003.4394v1 [cs.CL]) developed a compositional model of meaning for distributional semantics, in which each word in a sentence has a meaning vector and the distributional meaning of the sentence is a function of the tensor products of the word vectors. Abstractly speaking, this function is the morphism corresponding to the grammatical structure of the sentence in the category of finite dimensional vector spaces. In this paper, we provide a concrete method for implementing this linear meaning map, by constructing a corpus-based vector space for the type of sentence. Our construction method is based on structured vector spaces whereby meaning vectors of all sentences, regardless of their grammatical structure, live in the same vector space. Our proposed sentence space is the tensor product of two noun spaces, in which the basis vectors are pairs of words each augmented with a grammatical role. This enables us to compare meanings of sentences by simply taking the inner product of their vectors.
Other authorsSee publication -
Experimental Support for a Categorical Compositional Distributional Model of Meaning
Proceedings of the 2011 Conference on Empirical Methods in Natural Language Processing
Modelling compositional meaning for sentences using empirical distributional methods has been a challenge for computational linguists. We implement the abstract categorical model of Coecke et al. (2010) using data from the BNC and evaluate it. The implementation is based on unsupervised learning of matrices for relational words and applying them to the vectors of their arguments. The evaluation is based on the word disambiguation task developed by Mitchell and Lapata (2008) for intransitive…
Modelling compositional meaning for sentences using empirical distributional methods has been a challenge for computational linguists. We implement the abstract categorical model of Coecke et al. (2010) using data from the BNC and evaluate it. The implementation is based on unsupervised learning of matrices for relational words and applying them to the vectors of their arguments. The evaluation is based on the word disambiguation task developed by Mitchell and Lapata (2008) for intransitive sentences, and on a similar new experiment designed for transitive sentences. Our model matches the results of its competitors in the first experiment, and betters them in the second. The general improvement in results with increase in syntactic complexity showcases the compositional power of our model.
Other authors -
-
Experimenting with Transitive Verbs in a DisCoCat
Proceedings of the GEMS 2011 Workshop on GEometrical Models of Natural Language Semantics
Formal and distributional semantic models offer complementary benefits in modelling meaning. The categorical compositional distributional (DisCoCat) model of meaning of Coecke et al. (arXiv:1003.4394v1 [cs.CL]) combines aspected of both to provide a general framework in which meanings of words, obtained distributionally, are composed using methods from the logical setting to form sentence meaning. Concrete consequences of this general abstract setting and applications to empirical data are…
Formal and distributional semantic models offer complementary benefits in modelling meaning. The categorical compositional distributional (DisCoCat) model of meaning of Coecke et al. (arXiv:1003.4394v1 [cs.CL]) combines aspected of both to provide a general framework in which meanings of words, obtained distributionally, are composed using methods from the logical setting to form sentence meaning. Concrete consequences of this general abstract setting and applications to empirical data are under active study (Grefenstette et al., arxiv:1101.0309; Grefenstette and Sadrzadeh, arXiv:1106.4058v1 [cs.CL]). . In this paper, we extend this study by examining transitive verbs, represented as matrices in a DisCoCat. We discuss three ways of constructing such matrices, and evaluate each method in a disambiguation task developed by Grefenstette and Sadrzadeh (arXiv:1106.4058v1 [cs.CL]).
Other authors -
Honors & Awards
-
IJCAI-JAIR Best Paper Prize
The Journal of Artificial Intelligence Research (JAIR) and the International Joint Conference on Artificial Intelligence (IJCAI)
The Annual IJCAI-JAIR Best Paper Prize is awarded to an outstanding paper published in JAIR in the preceding five calendar years.
My former DeepMind colleague Richard Evans and I were awarded the 2021 prize for our 2018 work on Learning Explanatory Rules from Noisy Data. -
Best Long Paper Award at *SEM2013
SIGSEM/SIGLEX
Received for the paper "Towards a Formal Distributional Semantics: Simulating Logical Calculi with Tensors".
-
Westerman Pathfinders to the East
Balliol College, Oxford
Award of £4000 to go towards visiting alumnae of the college in East Asia.
-
EPSRC Enhanced Stipend
Engineering and Physical Sciences Research Council
The EPSRC provided enhanced stipend to select EPSRC scholars in the form of set bonuses paid at key waypoints of the doctorate.
-
Bishop Warner Bursary
Balliol College, Oxford
-
EPSRC Doctoral Scholarship
Engineering and Physical Sciences Research Council
The EPSRC Doctoral Training Account is a merit based scholarship which covered all university and provided a competitive stipend during the duration of my doctorate.
Languages
-
English
Native or bilingual proficiency
-
French
Native or bilingual proficiency
-
Japanese
Elementary proficiency
-
Italian
Elementary proficiency
More activity by Edward
-
The SEA workshop at NeurIPS 2025 is happening tomorrow. It seems we urgently need more open, realistic agent environments for training and evaluating…
The SEA workshop at NeurIPS 2025 is happening tomorrow. It seems we urgently need more open, realistic agent environments for training and evaluating…
Liked by Edward Grefenstette
-
Inspiring evening yesterday with Radical Ventures, listening to Jordan Jacobs moderate a fireside chat between Jeff Dean and Geoffrey Hinton. It is…
Inspiring evening yesterday with Radical Ventures, listening to Jordan Jacobs moderate a fireside chat between Jeff Dean and Geoffrey Hinton. It is…
Liked by Edward Grefenstette
-
The AAAI Award for Artificial Intelligence for the Benefit of Humanity recognizes the positive impacts of artificial intelligence to protect…
The AAAI Award for Artificial Intelligence for the Benefit of Humanity recognizes the positive impacts of artificial intelligence to protect…
Liked by Edward Grefenstette
-
We'll soon be launching CuspAI's King’s Cross Hub, led by Alessandro De Maria and Aron Walsh... We're looking for the brightest and most mission…
We'll soon be launching CuspAI's King’s Cross Hub, led by Alessandro De Maria and Aron Walsh... We're looking for the brightest and most mission…
Liked by Edward Grefenstette
-
Come work with us at Google DeepMind on the hot topic of Media Integrity, AI based deliberation, experiment with new and healthier ecosystems and the…
Come work with us at Google DeepMind on the hot topic of Media Integrity, AI based deliberation, experiment with new and healthier ecosystems and the…
Liked by Edward Grefenstette
-
Very excited that we can finally share some great foundational work Sian Gooding did, a while back. This has seeded and evolved into an exciting…
Very excited that we can finally share some great foundational work Sian Gooding did, a while back. This has seeded and evolved into an exciting…
Shared by Edward Grefenstette
Other similar profiles
Explore collaborative articles
We’re unlocking community knowledge in a new way. Experts add insights directly into each article, started with the help of AI.
Explore More