<?xml version="1.0" encoding="UTF-8"?> <rss
version="2.0"
xmlns:content="http://purl.org/rss/1.0/modules/content/"
xmlns:wfw="http://wellformedweb.org/CommentAPI/"
xmlns:dc="http://purl.org/dc/elements/1.1/"
xmlns:atom="http://www.w3.org/2005/Atom"
xmlns:sy="http://purl.org/rss/1.0/modules/syndication/"
xmlns:slash="http://purl.org/rss/1.0/modules/slash/"
> <channel><title>simon.net.nz &#187; linguistics</title> <atom:link href="http://simon.net.nz/articles/category/linguistics/feed/" rel="self" type="application/rss+xml" /><link>http://simon.net.nz</link> <description>Dr. Simon J. Greenhill&#039;s website</description> <lastBuildDate>Mon, 26 Mar 2012 23:59:07 +0000</lastBuildDate> <language>en</language> <sy:updatePeriod>hourly</sy:updatePeriod> <sy:updateFrequency>1</sy:updateFrequency> <generator>http://wordpress.org/?v=</generator> <item><title>Universal typological dependencies should be detectable in the history of language families</title><link>http://simon.net.nz/articles/universal-typological-dependencies-should-be-detectable-in-the-history-of-language-families/</link> <comments>http://simon.net.nz/articles/universal-typological-dependencies-should-be-detectable-in-the-history-of-language-families/#comments</comments> <pubDate>Tue, 11 Oct 2011 22:53:43 +0000</pubDate> <dc:creator>simon</dc:creator> <category><![CDATA[linguistics]]></category> <category><![CDATA[phylogenetics]]></category> <category><![CDATA[publications]]></category> <guid
isPermaLink="false">http://simon.net.nz/?p=217</guid> <description><![CDATA[Levinson SC, Greenhill SJ, Gray RD, &#038; Dunn M. (2011) Universal typological dependencies should be detectable in the history of language families. Linguistic Typology, 15: 509-534.]]></description> <content:encoded><![CDATA[<blockquote><p> We claim that making sense of the typological diversity of languages demands a historical/evolutionary approach. We are pleased that the target paper (Dunn et al. 2011a) has served to bring discussion of this claim into prominence, and are grateful that leading typologists have taken the time to respond (commentaries denoted by boldface). It is unfortunate though that a number of the commentaries in this special issue show significant misunderstandings of our paper.<br
/> &#8230;<br
/> In the following section we try to explain the basic underlying reasoning, turning in the remaining sections to some of these recurrent points of contention. In the final section, we collect some responses to points made in individual commentaries that did not fit neatly into the body of our response.</p></blockquote> ]]></content:encoded> <wfw:commentRss>http://simon.net.nz/articles/universal-typological-dependencies-should-be-detectable-in-the-history-of-language-families/feed/</wfw:commentRss> <slash:comments>0</slash:comments> </item> <item><title>POLLEX-Online: The Polynesian Lexicon Project Online</title><link>http://simon.net.nz/articles/pollex-online-the-polynesian-lexicon-project-online/</link> <comments>http://simon.net.nz/articles/pollex-online-the-polynesian-lexicon-project-online/#comments</comments> <pubDate>Fri, 15 Jul 2011 22:28:14 +0000</pubDate> <dc:creator>simon</dc:creator> <category><![CDATA[austronesian]]></category> <category><![CDATA[linguistics]]></category> <category><![CDATA[publications]]></category> <guid
isPermaLink="false">http://simon.net.nz/?p=207</guid> <description><![CDATA[Greenhill SJ &#038; Clark R (2011). POLLEX-Online: The Polynesian Lexicon Project Online. Oceanic Linguistics, 50(2), 551-559]]></description> <content:encoded><![CDATA[<blockquote><p> The Polynesian lexicon project, POLLEX, was initiated in 1965 by Bruce Biggs in order to provide a large-scale comparative dictionary of Polynesian languages. Since then, POLLEX has grown to include over 55,000 reflexes of more than 4,700 reconstructed forms in 68 languages. These data have enabled many fundamental advances in Polynesian linguistics and prehistory. At almost half a century old, POLLEX is one of the longest-standing databases of linguistic information, and has moved through various incarnations, from type- writer and edge-punched cards, through microfiche to mainframe computer. In the last few years, online databases of linguistic information have become increasingly more prevalent, representing a major shift in the way linguistics is conducted. Online databases provide many advantages over the older forms of data distribution, including high availability, more robust data storage, and easy data manipulation and searching, and they also facilitate the replication of previous studies. This paper announces the latest reincarnation of the POLLEX database as an online resource, POLLEX-Online (<a
href="http://pollex.org.nz">http://pollex.org.nz</a>), and describes the technical implementation details.</p></blockquote> ]]></content:encoded> <wfw:commentRss>http://simon.net.nz/articles/pollex-online-the-polynesian-lexicon-project-online/feed/</wfw:commentRss> <slash:comments>0</slash:comments> </item> <item><title>Levenshtein distances fail to identify language relationships accurately</title><link>http://simon.net.nz/articles/levenshtein-distances-fail-to-identify-language-relationships-accurately/</link> <comments>http://simon.net.nz/articles/levenshtein-distances-fail-to-identify-language-relationships-accurately/#comments</comments> <pubDate>Fri, 15 Jul 2011 22:26:31 +0000</pubDate> <dc:creator>simon</dc:creator> <category><![CDATA[linguistics]]></category> <category><![CDATA[phylogenetics]]></category> <category><![CDATA[publications]]></category> <guid
isPermaLink="false">http://simon.net.nz/?p=205</guid> <description><![CDATA[Greenhill SJ (2011). Levenshtein distances fail to identify language relationships accurately. Computational Linguistics, 37(4): 689-698.]]></description> <content:encoded><![CDATA[<blockquote><p> The Levenshtein distance is a simple distance metric derived from the number of edit operations needed to transform one string into another. This metric has received recent attention as a means of automatically classifying languages into genealogical subgroups. In this paper I test the performance of the Levenshtein distance for classifying languages by subsampling three language subsets from a large database of Austronesian languages. Comparing the classification proposed by the Levenshtein distance to that of the comparative method shows that the Levenshtein classification is correct only 40% of time. Standardising the orthography increases the performance, but only to a maximum of 65% accuracy within language subgroups. The accuracy of the Levenshtein classification decreases rapidly with phylogenetic distance, failing to discriminate homology and chance similarity across distantly related languages. This poor performance suggests the need for more linguistically nuanced methods for automated language classification tasks.</p></blockquote> ]]></content:encoded> <wfw:commentRss>http://simon.net.nz/articles/levenshtein-distances-fail-to-identify-language-relationships-accurately/feed/</wfw:commentRss> <slash:comments>0</slash:comments> </item> <item><title>Evolved structure of language shows lineage-specific trends in word-order universals</title><link>http://simon.net.nz/articles/evolved-structure-of-language-shows-lineage-specific-trends-in-word-order-universals/</link> <comments>http://simon.net.nz/articles/evolved-structure-of-language-shows-lineage-specific-trends-in-word-order-universals/#comments</comments> <pubDate>Wed, 13 Apr 2011 19:15:58 +0000</pubDate> <dc:creator>simon</dc:creator> <category><![CDATA[linguistics]]></category> <category><![CDATA[publications]]></category> <guid
isPermaLink="false">http://simon.net.nz/?p=190</guid> <description><![CDATA[Dunn M, Greenhill SJ, Levinson SC, &#038; Gray RD. 2011. Evolved structure of language shows lineage-specific trends in word-order universals. Nature. 473, 79–82. doi:10.1038/nature09923]]></description> <content:encoded><![CDATA[<p>Some colleagues and I have a new paper out in <i>Nature</i> showing that the evolved structure of language shows lineage-specific trends in word-order universals. I&#8217;ve written an overview/FAQ on <a
href="http://language.psy.auckland.ac.nz/wordorder/">this paper here</a>, and there&#8217;s a nice review of it <a
href="http://www.nature.com/news/2011/110413/full/news.2011.231.html">here</a> and <a
href="http://www.nature.com/nature/journal/v472/n7342/full/472136a.html">here</a>.</p><p>The Abstract:</p><blockquote><p> Languages vary widely but not without limit. The central goal of linguistics is to describe the diversity of human languages and explain the constraints on that diversity. Generative linguists following Chomsky have claimed that linguistic diversity must be constrained by innate parameters that are set as a child learns a language. In contrast, other linguists following Greenberg have claimed that there are statistical tendencies for co-occurrence of traits reflecting universal systems biases, rather than absolute constraints or parametric variation. Here we use computational phylogenetic methods to address the nature of constraints on linguistic diversity in an evolutionary framework. First, contrary to the generative account of parameter setting, we show that the evolution of only a few word-order features of languages are strongly correlated. Second, contrary to the Greenbergian generalizations, we show that most observed functional dependencies between traits are lineage-specific rather than universal tendencies. These findings support the view that—at least with respect to word order—cultural evolution is the primary factor that determines linguistic structure, with the current state of a linguistic system shaping and constraining future states.</p></blockquote> ]]></content:encoded> <wfw:commentRss>http://simon.net.nz/articles/evolved-structure-of-language-shows-lineage-specific-trends-in-word-order-universals/feed/</wfw:commentRss> <slash:comments>0</slash:comments> </item> <item><title>On the shape and fabric of human history</title><link>http://simon.net.nz/articles/on-the-shape-and-fabric-of-human-history/</link> <comments>http://simon.net.nz/articles/on-the-shape-and-fabric-of-human-history/#comments</comments> <pubDate>Wed, 01 Sep 2010 06:40:11 +0000</pubDate> <dc:creator>simon</dc:creator> <category><![CDATA[cultural evolution]]></category> <category><![CDATA[linguistics]]></category> <category><![CDATA[phylogenetics]]></category> <category><![CDATA[publications]]></category> <guid
isPermaLink="false">http://simon.net.nz/?p=161</guid> <description><![CDATA[Gray RD, Bryant D, &#038; Greenhill SJ (2010) On the shape and fabric of human history. Philosophical Transactions of the Royal Society, B, 365:3923-3933]]></description> <content:encoded><![CDATA[<blockquote><p> In this paper we outline two debates about the nature of human cultural history. The first focuses on the extent to which human history is treelike (its shape), and the second on the unity of that history (its fabric).</p><p>Proponents of cultural phylogenetics are often accused of assuming that human history has been both highly tree-like and consists of tightly linked lineages. Critics have pointed out obvious exceptions to these assumptions.</p><p>Instead of a priori dichotomous disputes about the validity of cultural phylogenetics phylogenies, we suggest that the debate is better conceptualized as involving positions along continuous dimensions. The challenge for empirical research is therefore to determine where particular aspects of culture lie on these dimensions.</p><p>We discuss the ability of current computational methods derived from evolutionary biology to address these questions. These methods are then used to compare the extent to which lexical evolution is treelike in different parts of the world and to evaluate the coherence of cultural and linguistic lineages.</p></blockquote> ]]></content:encoded> <wfw:commentRss>http://simon.net.nz/articles/on-the-shape-and-fabric-of-human-history/feed/</wfw:commentRss> <slash:comments>0</slash:comments> </item> <item><title>The shape and tempo of language evolution</title><link>http://simon.net.nz/articles/the-shape-and-tempo-of-language-evolution/</link> <comments>http://simon.net.nz/articles/the-shape-and-tempo-of-language-evolution/#comments</comments> <pubDate>Wed, 07 Apr 2010 07:51:13 +0000</pubDate> <dc:creator>Simon</dc:creator> <category><![CDATA[cultural evolution]]></category> <category><![CDATA[linguistics]]></category> <category><![CDATA[phylogenetics]]></category> <category><![CDATA[publications]]></category> <category><![CDATA[research]]></category> <guid
isPermaLink="false">http://simon.net.nz/?p=149</guid> <description><![CDATA[Greenhill SJ, Atkinson QD, Meade A, &#038; Gray RD. (2010) The shape and tempo of language evolution. Proceedings of the Royal Society, B, 277:2443-2450.]]></description> <content:encoded><![CDATA[</blockquote><p>There are approximately 7000 languages spoken in the world today. This diversity reflects the legacy of thousands of years of cultural evolution. How far back we can trace this history depends largely on the rate at which the different components of language evolve. Rates of lexical evolution are widely thought to impose an upper limit of 6-10 thousand years on reliably identifying language relationships. In contrast, it has been argued that certain structural elements of language are much more stable. Just as biologists use highly conserved genes to uncover the deepest branches in the tree of life, highly stable linguistic features hold the promise of identifying deep relationships between the world’s languages.<br
/> Here we present the first global network of languages based on this typological information. We evaluate the relative evolutionary rates of both typological and lexical features in the Austronesian and Indo-European language families. The first indications are that typological features evolve at similar rates to basic vocabulary but their evolution is substantially less treelike. Our results suggest that, whilst rates of vocabulary change are correlated between the two language families, the rates of evolution of typological features and structural sub-types show no consistent relationship across families.</p></blockquote><p><a
href='http://simon.net.nz/files/2010/04/Greenhill_et_al2010-preprint.pdf'>You can download a preprint of this paper here.</a></p><blockquote><div
id="attachment_151" class="wp-caption alignnone" style="width: 610px"><a
href="http://simon.net.nz/files/2010/04/Greenhill_et_al2009-fig1x-small.png"><img
class="size-full wp-image-151" title="Greenhill_et_al2009-fig1x-small" src="http://simon.net.nz/files/2010/04/Greenhill_et_al2009-fig1x-small.png" alt="Network of 99 languages" width="600" height="817" /></a><p
class="wp-caption-text">A network showing the relationships between 99 languages across the world. This network is built from structural information about language properties (phonology, morphology, grammar and lexicon). The length of the branches is proportional to the amount of divergence between the languages, and the box-like structures on the network show conflicting signal due to borrowing between languages and chance similarity.</p></div></blockquote> ]]></content:encoded> <wfw:commentRss>http://simon.net.nz/articles/the-shape-and-tempo-of-language-evolution/feed/</wfw:commentRss> <slash:comments>0</slash:comments> </item> <item><title>COOL8: The probability of proto-forms</title><link>http://simon.net.nz/articles/cool8-the-probability-of-proto-forms/</link> <comments>http://simon.net.nz/articles/cool8-the-probability-of-proto-forms/#comments</comments> <pubDate>Thu, 21 Jan 2010 20:14:21 +0000</pubDate> <dc:creator>Simon</dc:creator> <category><![CDATA[conferences]]></category> <category><![CDATA[linguistics]]></category> <category><![CDATA[phylogenetics]]></category> <category><![CDATA[talks]]></category> <guid
isPermaLink="false">http://simon.net.nz/?p=127</guid> <description><![CDATA[Last week I presented a talk to the 8th International Conference on Oceanic Linguistics on the probability of protoforms: Many papers about language subgrouping make the argument that if a word form is present in two language subgroups, then it probably reflects their common proto-language. This fairly reasonable assumption has taken center-stage in the reconstruction [...]]]></description> <content:encoded><![CDATA[<p>Last week I presented a talk to the <a
href="http://http://www.confer.co.nz/cool8/programme.html">8th International Conference on Oceanic Linguistics</a> on <i>the probability of protoforms</i>:</p><blockquote><p>Many papers about language subgrouping make the argument that if a word form is present in two language subgroups, then it probably reflects their common proto-language. This fairly reasonable assumption has taken center-stage in the reconstruction of Proto-Central-Pacific and Proto-Oceanic, and subsequent inferences about these respective societies. In this talk I will introduce a method that calculates the actual probability that a given form reflects the proto-language. This method builds on the linguistic comparative method, and provides a powerful tool for testing and confirming reconstructions in a probabilistic framework.</p></blockquote> ]]></content:encoded> <wfw:commentRss>http://simon.net.nz/articles/cool8-the-probability-of-proto-forms/feed/</wfw:commentRss> <slash:comments>0</slash:comments> </item> <item><title>Austronesian language phylogenies: Myths and misconceptions about Bayesian computational methods</title><link>http://simon.net.nz/articles/austronesian-language-phylogenies-myths-and-misconceptions-about-bayesian-computational-methods/</link> <comments>http://simon.net.nz/articles/austronesian-language-phylogenies-myths-and-misconceptions-about-bayesian-computational-methods/#comments</comments> <pubDate>Fri, 17 Jul 2009 01:07:26 +0000</pubDate> <dc:creator>Simon</dc:creator> <category><![CDATA[austronesian]]></category> <category><![CDATA[cultural evolution]]></category> <category><![CDATA[linguistics]]></category> <category><![CDATA[phylogenetics]]></category> <category><![CDATA[publications]]></category> <category><![CDATA[research]]></category> <guid
isPermaLink="false">http://simon.net.nz/?p=68</guid> <description><![CDATA[Greenhill SJ &#038; Gray RD (2009) Austronesian language phylogenies: Myths and misconceptions about Bayesian computational methods. In Austronesian historical linguistics and culture history: a festschrift for Robert Blust (Pp 375-397). A. Adelaar &#038; A. Pawley (Eds). Canberra: Pacific Linguistics.]]></description> <content:encoded><![CDATA[<blockquote><p> Historical linguistics has never been particularly intimate with computers. The first wave of computational historical linguistics—lexicostatistics—was developed in the 1950s and quickly applied to language groups around the world from Indo-European to Austronesian. However, critics were quick to point out the problems caused by assuming a single constant rate of lexical replacement and repeatedly noted the erroneous results that this produced. As a consequence of these critiques lexicostatistics has been widely rejected by mainstream historical linguists. The last few years have seen a second wave of computational approaches entering historical linguistics: phylogenetic methods. These techniques, drawn from evolutionary biology, have been used to investigate some provocative and controversial claims about human prehistory. Given the combination of strong claims, new techniques, and the high-profile reporting of results, it is not surprising that these studies are often controversial. Sadly many of these criticisms are mired in misunderstanding.</p><p>Computational phylogenetic methods are not just lexicostatistics redux, but a powerful supplement to the comparative method used in historical linguistics. Here we will focus on one of the great battlegrounds between lexicostatistics and the traditional comparative method: the Austronesian language family. First, we will describe how Bayesian phylogenetic methods work, and then give a step-by-step explanation of an analysis of a large lexical dataset for 400 Austronesian languages.</p></blockquote> ]]></content:encoded> <wfw:commentRss>http://simon.net.nz/articles/austronesian-language-phylogenies-myths-and-misconceptions-about-bayesian-computational-methods/feed/</wfw:commentRss> <slash:comments>0</slash:comments> </item> <item><title>Does horizontal transmission invalidate cultural phylogenies?</title><link>http://simon.net.nz/articles/does-horizontal-transmission-invalidate-cultural-phylogenies/</link> <comments>http://simon.net.nz/articles/does-horizontal-transmission-invalidate-cultural-phylogenies/#comments</comments> <pubDate>Wed, 18 Mar 2009 21:50:21 +0000</pubDate> <dc:creator>Simon</dc:creator> <category><![CDATA[cultural evolution]]></category> <category><![CDATA[linguistics]]></category> <category><![CDATA[phylogenetics]]></category> <category><![CDATA[publications]]></category> <category><![CDATA[research]]></category> <guid
isPermaLink="false">http://simon.net.nz/?p=61</guid> <description><![CDATA[Greenhill SJ, Currie TE, &#038; Gray RD (2009)
Does horizontal transmission invalidate cultural phylogenies? Proceedings of the Royal Society B. 276: 2299-2306.]]></description> <content:encoded><![CDATA[<blockquote><p> Phylogenetic methods have recently been applied to studies of cultural evolution. However, it has been claimed that the large amount of horizontal transmission that sometimes occurs between cultural groups invalidates the use of these methods. Here, we use a natural model of linguistic evolution to simulate borrowing between languages. The results show that tree topologies constructed with Bayesian phylogenetic methods are robust to realistic levels of borrowing. Inferences about divergence dates are slightly less robust and show a tendency to underestimate dates. Our results demonstrate that realistic levels of reticulation between cultures do not invalidate a phylogenetic approach to cultural and linguistic evolution.</p></blockquote> ]]></content:encoded> <wfw:commentRss>http://simon.net.nz/articles/does-horizontal-transmission-invalidate-cultural-phylogenies/feed/</wfw:commentRss> <slash:comments>0</slash:comments> </item> <item><title>The Austronesian Basic Vocabulary Database: From Bioinformatics to Lexomics</title><link>http://simon.net.nz/articles/the-austronesian-basic-vocabulary-database-from-bioinformatics-to-lexomics/</link> <comments>http://simon.net.nz/articles/the-austronesian-basic-vocabulary-database-from-bioinformatics-to-lexomics/#comments</comments> <pubDate>Mon, 03 Nov 2008 04:21:23 +0000</pubDate> <dc:creator>Simon</dc:creator> <category><![CDATA[austronesian]]></category> <category><![CDATA[cultural evolution]]></category> <category><![CDATA[linguistics]]></category> <category><![CDATA[publications]]></category> <guid
isPermaLink="false">http://simon.net.nz/?p=49</guid> <description><![CDATA[Greenhill SJ, Blust R, &#038; Gray RD (2008) The Austronesian Basic Vocabulary Database: From Bioinformatics to Lexomics. Evolutionary Bioinformatics, 4:271-283.]]></description> <content:encoded><![CDATA[<blockquote><p>Phylogenetic methods have revolutionised evolutionary biology and have recently been applied to studies of linguistic and cultural evolution. However, the basic comparative data on the languages of the world required for these analyses is often widely dispersed in hard to obtain sources. Here we outline how our Austronesian Basic Vocabulary Database (ABVD) helps remedy this situation by collating wordlists from over 500 languages into one web-accessible database. We describe the technology underlying the ABVD and discuss the benefits that an evolutionary bioinformatic approach can provide. These include facilitating computational comparative linguistic research, answering questions about human prehistory, enabling syntheses with genetic data, and safe-guarding fragile linguistic information.</p></blockquote> ]]></content:encoded> <wfw:commentRss>http://simon.net.nz/articles/the-austronesian-basic-vocabulary-database-from-bioinformatics-to-lexomics/feed/</wfw:commentRss> <slash:comments>0</slash:comments> </item> </channel> </rss>
