Thanks to visit codestin.com
Credit goes to link.springer.com

Skip to main content
Springer Nature Link
Log in
Menu
Find a journal Publish with us Track your research
Search
Cart
  1. Home
  2. Behavior Research Methods
  3. Article

SUBTLEX-NL: A new measure for Dutch word frequency based on film subtitles

  • Published: August 2010
  • Volume 42, pages 643–650, (2010)
  • Cite this article
Download PDF
Behavior Research Methods Aims and scope Submit manuscript
SUBTLEX-NL: A new measure for Dutch word frequency based on film subtitles
Download PDF
  • Emmanuel Keuleers2,
  • Marc Brysbaert2 &
  • Boris New1 
  • 3646 Accesses

  • 476 Citations

  • Explore all metrics

Abstract

We present a new database of Dutch word frequencies based on film and television subtitles, and we validate it with a lexical decision study involving 14,000 monosyllabic and disyllabic Dutch words. The new SUBTLEX frequencies explain up to 10% more variance in accuracies and reaction times (RTs) of the lexical decision task than the existing CELEX word frequency norms, which are based largely on edited texts. As is the case for English, an accessibility measure based on contextual diversity explains more of the variance in accuracy and RT than does the raw frequency of occurrence counts. The database is freely available for research purposes and may be downloaded from the authors’ university site at http://crr.ugent.be/subtlex-nl or from http://brm psychonomic-journals.org/content/supplemental.

Article PDF

Download to read the full article text

Similar content being viewed by others

SUBTLEX-CAT: Subtitle word frequencies and contextual diversity for Catalan

Article 20 March 2019

Bias in dyslexia screening in a Dutch multicultural population

Article Open access 23 February 2018

Lexical processing depends on sublexical processing: Evidence from the visual world paradigm and aphasia

Article 03 April 2019

Explore related subjects

Discover the latest articles, books and news in related subjects, suggested using machine learning.
  • Bilingualism
  • Corpus Linguistics
  • Germanic Languages
  • Language Processing
  • Lexicolopgy / Vocabulary
  • Linguistics
Use our pre-submission checklist

Avoid common mistakes on your manuscript.

References

  • Adelman, J. S., Brown, G. D. A., & Quesada, J. F. (2006). Contextual diversity, not word frequency, determines word-naming and lexical decision times. Psychological Science, 17, 814–823. doi:10.1111/j.1467-9280.2006.01787.x

    Article  PubMed  Google Scholar 

  • Baayen, R. H., Feldman, L. B., & Schreuder, R. (2006). Morphological influences on the recognition of monosyllabic monomorphemic words. Journal of Memory & Language, 55, 290–313.

    Article  Google Scholar 

  • Baayen, R. H., Piepenbrock, R., & van Rijn, H. (1993). The CELEX Lexical Database [CD-ROM]. Philadelphia: Linguistic Data Consortium, University of Pennsylvania.

    Google Scholar 

  • Balota, D. A., Cortese, M. J., & Pilotti, M. (1999). Item-level analyses of lexical decision performance: Results from a mega-study. Abstracts of the 40th Annual Meeting of the Psychonomic Society, 4, 44.

    Google Scholar 

  • Balota, D. A., Cortese, M. J., Sergent-Marshall, S. D., Spieler, D. H., & Yap, M. J. (2004). Visual word recognition of single-syllable words. Journal of Experimental Psychology: General, 133, 283–316.

    Article  Google Scholar 

  • Balota, D. A., Yap, M. J., Cortese, M. J., Hutchison, K. A., Kessler, B., Loftis, B., et al. (2007). The English Lexicon Project. Behavior Research Methods, 39, 445–459.

    Article  PubMed  Google Scholar 

  • Bontrager, T. (1991). The development of word frequency lists prior to the 1944 Thorndike-Lorge list. Reading Psychology, 12, 91–116. doi:10.1080/0270271910120201

    Article  Google Scholar 

  • Brants, T., & Franz, A. (2006). Web 1T 5-Gram Corpus (Version 1). Philadelphia: Linguistic Data Consortium, University of Pennsylvania.

    Google Scholar 

  • Brysbaert, M., & New, B. (2009). Moving beyond Kučera and Francis: A critical evaluation of current word frequency norms and the introduction of a new and improved word frequency measure for American English. Behavior Research Methods, 41, 977–990. doi:10.3758/ BRM.41.4.977

    Article  PubMed  Google Scholar 

  • Burgess, C., & Livesay, K. (1998). The effect of corpus size in predicting reaction time in a basic word recognition task: Moving on from Kučera and Francis. Behavior Research Methods, Instruments, & Computers, 30, 272–277.

    Article  Google Scholar 

  • Cassel, D. (2007, May 17). Police raid Polish subtitle site [Online article]. Retrieved from http://tech.blorge.com/Structure:%20 /2007/05/17/police-raid-polish-subtitle-site/.

  • Cortese, M. J., & Khanna, M. M. (2007). Age of acquisition predicts naming and lexical-decision performance above and beyond 22 other predictor variables: An analysis of 2,342 words. Quarterly Journal of Experimental Psychology, 60, 1072–1082.

    Article  Google Scholar 

  • Enigmax (2009, February 5). Hackers hit anti-pirates to avenge subsite takedown [Online article]. Retrieved from http://torrentfreak .com/hackers-hit-anti-pirates-to-avenge-sub-site-takedown-090205/.

  • Ghyselinck, M., Lewis, M. B., & Brysbaert, M. (2004). Age of acquisition and the cumulative-frequency hypothesis: A review of the literature and a new multi-task investigation. Acta Psychologica, 115, 43–67.

    Article  PubMed  Google Scholar 

  • Johnston, R. A., & Barry, C. (2006). Age of acquisition and lexical processing. Visual Cognition, 13, 789–845.

    Article  Google Scholar 

  • Juhasz, B. J. (2005). Age-of-acquisition effects in word and picture identification. Psychological Bulletin, 131, 684–712.

    Article  PubMed  Google Scholar 

  • Keuleers, E., & Brysbaert, M. (2010). Wuggy: A multilingual pseudoword generator. Behavior Research Methods, 42, 627–633.

    Article  PubMed  Google Scholar 

  • Kuera, H., & Francis, W. (1967). Computational analysis of presentday American English. Providence, RI: Brown University Press.

    Google Scholar 

  • New, B., Brysbaert, M., Veronis, J., & Pallier, C. (2007). The use of film subtitles to estimate word frequencies. Applied Psycholinguistics, 28, 661–677.

    Article  Google Scholar 

  • Shaoul, C., & Westbury, C. (2009). A USENET corpus (2005–2009). Edmonton: University of Alberta. Retrieved from www.psych.ualberta.ca/~westburylab/downloads/usenetcorpus.download.html.

    Google Scholar 

  • Stevens, M., Lammertyn, J., Verbruggen, F., & Vandierendonck, A. (2006). Tscope: A C library for programming cognitive experiments on the MS Windows platform. Behavior Research Methods, 38, 280–286.

    Article  PubMed  Google Scholar 

  • Thorndike, E. L., & Lorge, I. (1944). The teacher’s word book of 30,000 words. New York: Columbia University, Teachers College.

    Google Scholar 

  • Uit den Boogaart, P. C. (Ed.) (1975). Woordfrequenties in geschreven en gesproken Nederlands. Utrecht: Oosthoek, Scheltema Holkema.

    Google Scholar 

  • van Berckel, J., Brandt Corstius, H., Mokken, R., & van Wijngaarden, A. (1965). Formal properties of newspaper Dutch. Amsterdam: Mathematisch Centrum Amsterdam.

    Google Scholar 

  • van den Bosch, A., Busser, B., Canisius, S., & Daelemans, W. (2007). An efficient memory-based morpho-syntactic tagger and parser for Dutch. In P. Dirix, I. Schuurman, V. Vandeghinste, & F. Van Eynde (Eds.), Computational linguistics in the Netherlands: Selected papers from the Seventeenth CLIN Meeting (pp. 99-114). Leuven.

  • Yap, M. J., & Balota, D. A. (2009). Visual word recognition of multisyllabic words. Journal of Memory & Language, 60, 502–529. doi:10.1016/j.jml.2009.02.001

    Article  Google Scholar 

  • Yarkoni, T., Balota, D., & Yap, M. (2008). Moving beyond Coltheart’s N: A new measure of orthographic similarity. Psychonomic Bulletin & Review, 15, 971–979.

    Article  Google Scholar 

  • Zeno, S. M., Ivens, S. H., Millard, R. T., & Duvvuri, R. (1995). The educator’s word frequency guide. Brewster, NJ: Touchstone Applied Science Associates.

    Google Scholar 

  • Zevin, J. D., & Seidenberg, M. S. (2002). Age of acquisition effects in word reading and other tasks. Journal of Memory & Language, 47, 1–29. doi:10.1006/jmla.2001.2834

    Article  Google Scholar 

Download references

Author information

Authors and Affiliations

  1. Université Paris Descartes and CNRS UMR 8189, Paris, France

    Boris New

  2. Department of Experimental Psychology, Ghent University, Henri Dunantlaan 2, B-9000, Ghent, Belgium

    Emmanuel Keuleers & Marc Brysbaert

Authors
  1. Emmanuel Keuleers
    View author publications

    Search author on:PubMed Google Scholar

  2. Marc Brysbaert
    View author publications

    Search author on:PubMed Google Scholar

  3. Boris New
    View author publications

    Search author on:PubMed Google Scholar

Corresponding author

Correspondence to Emmanuel Keuleers.

Electronic supplementary material

Supplementary material, approximately 340 KB.

Rights and permissions

Reprints and permissions

About this article

Cite this article

Keuleers, E., Brysbaert, M. & New, B. SUBTLEX-NL: A new measure for Dutch word frequency based on film subtitles. Behavior Research Methods 42, 643–650 (2010). https://doi.org/10.3758/BRM.42.3.643

Download citation

  • Received: 12 August 2009

  • Accepted: 27 March 2010

  • Issue date: August 2010

  • DOI: https://doi.org/10.3758/BRM.42.3.643

Share this article

Anyone you share the following link with will be able to read this content:

Sorry, a shareable link is not currently available for this article.

Provided by the Springer Nature SharedIt content-sharing initiative

Keywords

  • Lexical Decision
  • Word Frequency
  • Contextual Diversity
  • Word Form
  • Lexical Decision Time
Use our pre-submission checklist

Avoid common mistakes on your manuscript.

Advertisement

Search

Navigation

  • Find a journal
  • Publish with us
  • Track your research

Discover content

  • Journals A-Z
  • Books A-Z

Publish with us

  • Journal finder
  • Publish your research
  • Language editing
  • Open access publishing

Products and services

  • Our products
  • Librarians
  • Societies
  • Partners and advertisers

Our brands

  • Springer
  • Nature Portfolio
  • BMC
  • Palgrave Macmillan
  • Apress
  • Discover
  • Your US state privacy rights
  • Accessibility statement
  • Terms and conditions
  • Privacy policy
  • Help and support
  • Legal notice
  • Cancel contracts here

132.145.61.108

Not affiliated

Springer Nature

© 2025 Springer Nature