An edited and accessible (formatted, numbered, searchable) edition of Matthias Buchmheier’s FREQUENCY LISTING OF ITALIAN WORD-FORMS
(http://en.wiktionary.org/wiki/User:Matthias_Buchmeier: This list can be used under the terms of the cc-by-sa, GFDL, or LGPL licenses.)
Edited by Robert B. Youngblood
Matthias Buchmeier’s listing is an electronic transfer based on 5,765,191 Italian words of sub-titled dialog of international feature films and television series for Italian audiences.
Buchmeier’s frequency listing comprises just under 75,000 words. It is the most complete, discrete frequency list known to the editor — a huge repository. In its online layout form, however, it is a raw listing, reproducing word after word, line after line across web pages of 5,000 words apiece.
Buchmeier’s list reproduces the individual form of each spoken word — i.e., each form of the declinable parts of speech, the various determiner forms, individual conjugated verb forms, personal pronouns, verb forms plus pronouns (e.g., “farlo, smettila, eccoli, dagli, darle, esserci, vattene”), for instance, both singular and plural noun forms, individual 2- and 4-form adjectives, compound prepositions plus articles — each form by descending frequency down to two-occurrence words.
Buchmeier’s list is impossible to search. The editor of this listing has transformed the raw listing into a clearly formatted, numerical and searchable listing.