Showing posts with label articulation. Show all posts
Showing posts with label articulation. Show all posts

11.09.2007

what's in a whole nother error?

so, in my last post, i discussed the two definitions of error: grammatical or usage error ('spicket' for 'spigot') and technical speech errors ('cattleships and bruisers' for 'battleships and cruisers'). i'd like to address another, somewhat related, error issue that sprung from my thesis project on typos.

speech errors follow very predictable patterns. they are totally constrained by certain rules such as 'vowels only slip with vowels' and 'speech errors only produce valid syllables.' that is, 'bad' would never slip and become 'abd' because this involves a consonant/vowel swap and produces a sound combination, 'bd,' that doesn't occur in english.

clearly, typos don't obey these rules. the common typo the-->hte is a good example. 'ht' is not a valid word beginning, and yet this typo occurs all the time. most of the typos that violate the rules of speech errors have pretty clear motivations. 'hte' involves switching two adjacent letters. 'tghis' involves accidentally hitting a key next to the intended one. but in my thesis, i found a lot of typos that seem unmotivated by physical factors of the keyboard. for instance, someone substituted a 'b' for a 'p' and one typed 'soom bo-' when they meant to type 'some books.'

similarity of units has always been acknowledged as a factor in speech errors. each speech sound has a set of features (place of articulation in the mouth, manner of articulation, and whether or not the vocal chords are vibrating), and the more features two sounds share, the more likely they are to slip with one another. 'b' for 'p' makes sense as a speech error because the two sounds share every feature except voicing. but as a typo, it's hard to understand because the two letters are so far apart on the keyboard.

so it seems that there are some typos that obey the speech error constraints and are based on phonetic similarity. but some are based only on motoric or articulatory similarity. this has a couple of implications, i believe.

first, it seems to indicate that we do use inner speech when we type. the sounds of language play a role in the way we plan our typing, which doesn't involve sound in any other way. second, it means that we're able to separate articulatory features from phonetic features. this makes the comparison to speech kind of tricky, because in speech, the notions of articulatory and phonological similarity are the same. that is, phonological similarity is based on the physical characteristics of a particular medium--the human mouth we all share. it's as much an historical accident as they layout of the keyboard!

AND i have a completely unfounded theory! another thing happens in speech because of articulatory factors: transformations in quick speech. these are things like 'did you eat?' becoming 'jeet?' linguists don't consider these errors, in part because they're very predictable. so what if these kinds of abbreviations--which, remember, occur because of fast articulation--were analogous to articulatory errors (those caused by the layout of the keyboard) in typing? these kind of errors are, almost certainly, more common in fast typing and, if collapses aren't counted as errors, typing errors vastly outnumber speech errors. i'm not saying the numbers shold match perfectly, but the difference in frequency i found was really huge, and when i compared speech errors only to speech-error like typos, the numbers were quite similar. (sorry i'm being too lazy to actually haul out my thesis and give you the exact numbers. i'll send you a copy if you want one.)

i think the only real difference between articlatory laziness in speech and typing errors is that the former is completely expected. it's only in recent years that typing has become a real-time communications medium, and i would guess that we'll start to see more of these typos going uncorrected. i've certainly noticed that i tend to not correct them in informal im conversations, especially if they're fairly predictable and don't disrupt interpretation, like how-->hwo.

it's also interesting to note that almost all the definitions of error that we've encountered involve accidental violation of the speaker's intention. and these articulatory foibles certainly fit that bill. few people would think they meant to say 'jew eat?' or 'jeet?' rather than 'did you eat?' but we all do it the same way we all occasionally, predictably type 'teh.'

4.12.2007

thethesisis full of errors!

greetings, gentle readers! i apologize, once again, for the long hiatus. this post will give a brief preview of the fruits of the labor that has kept me from blogging ere these long weeks. my thesis about typos is nearing completion, and, believe it or not, i actually have some interesting conclusions to show for it! what follows is a synopsis of what my thesis will become over the next couple of weeks... *knocks on wood*.

many of the studies of typos i read are inadequate to describe naturalistic typing (that is, speech over IM, for instance), because they test people doing transcription typing, or they gather their corpora (sets of linguistic material--in this case, errors) from published works. neither of these data sets is appropriate for comparing the errors made in typing to the errors made in speaking; published typos are very selective because they are only those that escaped the notice of several proof-readers and editors. transcription studies don't work because they involve a perceptual element, reading, that a) doesn't occur in speech and b) might change the types of errors made in ways that are, as yet, unpredictable.

so i brought 20 undergrads into the lab and made them (paid them to) have IM conversations. they used software that recorded each of their keystrokes including the backspace key. These 10 conversations generated almost 350 typos, including things that were clearly 'edits' (planned utterances that were deleted before being sent). My spreadsheet called 'Typos that Matter' contains 192 rows. These are typos that involve individual letters or sounds. They will be the focus of my study!

one of the main observations made about speech errors is their striking regularity. for example, in speech, consonants only slip with other consonants and vowels only slip with other vowels. Gary Dell, one of the big speech error guys, uses a set of four rules, including this consonant-vowel category effect, to identify 'human-like' speech errors. Other rules include the phonotactic regularity effect--speech errors do not cause combinations of letters that are illegal in the language being spoken. For example, in English, you can't begin a word with 'tk,' so that combination never occurs at the beginning of a word as a result of an error.

it's immediately apparent that typos do not obey these same regularities. adjacent letters often slip, regardless of whether they are consonants or vowels, and illegal combinations occur fairly frequently.

another important difference concerns the volume of errors in speech versus typing. I brought half as many participants into the lab to have conversations over the phone. they produced only 5 Errors that Matter. the ratio of speech to typing errors in this setting is something on the order of 1 to 20.

both of the above facts are evidence that typos occur because of very different processes than speech errors do. however, the fact that made me want to study this in the first place is the similarity of some typos to speech errors. there is a set of typos that behave almost the same way as speech errors--these errors are of a specific type known as 'non-contextual substitutions.' in these cases, a letter is removed completely and replaced with another letter that does not exist in the surrounding utterance. "fright" typed for "bright" is a good example of this, whereas "rbight" for "bright" is a contextual exchange.

when i examined only single letter non-contextual substitution errors, a striking pattern emerged: they follow Dell's rules very very closely, almost entirely, as speech errors do. except one rule to which i'll return later.

since one class of errors follows cognitive constraints while others seem to ignore them completely, i plan to propose that there are two separate mechanisms responsible for typos. non-contextual substitutions (and perhaps other types of errors) are caused by errors in planning and internal speech--the same things that cause spoken errors. exchanges and other types of errors occur after planning at a lower level of processing and aren't susceptible to the filters or structures that make phonotactic and consonant-vowel category violations impossible. that's pretty neat!

the Dell rule that non-contextual substitutions do violate with abandon is known as the initialness effect. in speech errors involving consonants, the first consonant in a word is much more likely to be involved in an error than consonants at the end or middle of words are. the same is not true for typing errors! in fact, of the 22 non-contextual substitutions involving consonants that i've checked for consistency with Dell's rules, 16 violate the initialness effect. this is a striking effect that might point to a major difference in processing and planning across the two media.

so, not only are there different articulatory problems with typing than there are with speech, but perhaps the medium in which we do our articulation actually effects the way we plan and retrieve speech. i would try to claim this as empirical evidence that contemporary media theorists are correct--that communication technologies effect our cognition in a profound way--but that's a lot to say in 20 pages, and linguists don't take kindly to much of my favorite media theory...

look for a future post on plans for further research!

8.19.2006

text apparatuses

apparati?

so, even though i haven't done any research yet, i feel pretty comfortable hypothesizing that the layout of the keyboard is one factor in determining how language (text, in this case) is produced. it not only structures how we're able to create text on a computer, it has also irrevocably changed how language is changing. that is, in some circles 'teh' is a word with its own meaning. '!!!11oneeleven!!' is an expression of excitement. 'book' can mean 'cool' because they're frequently confused words in the text messaging system T-9.

the keyboard is a technology. it is a tool that helps humans accomplish a task; it extends the body in a certain way. it intervenes from without and acts upon language. but i would argue that it is also already a part of language. a historical linguist reflecting on this period of development 1000 years from now could not understand the process of language change that English (and probably all languages, to greater or lesser degrees) is undergoing wihout understanding the technology of the keyboard.

linguists have to study the configuration of the human mouth, lungs, and vocal cords in order to understand the way languages change over time and why people make the mistakes they do in processing it. is the mouth also an apparatus of language? does it intervene from without? is it the original extension of the body? is it an extension of the mind (from which language flows directly)?

is the brain an apparatus? is the way it's configured responsible for how language is configured and used? i think most linguists would say yes. how could we ever study language separately from the apparatuses used to produce it? but the brain is viewed as something integral to and inseparable from language use.

whereas the keyboard is not.

we could switch keyboards any time.

but we don't. we can't, actually. there's a better keyboard design ( the Dvorak simplified keyboard) out there. ndividual typists are perfectly capable of learning a new system quickly and effectively. the traditional QWERTY keyboard layout was intentionally designed to slow typists down back when hitting keys too fast would cause typewriters to jam. if the purpose of language were fast communication, and everyone worked toward the end of making communication efficient, we would all go buy Dvorak keyboards tomorrow and start producing different typos.

but everyone doesn't work toward the end of making communication efficient. everyone works toward the most conventional end. Convention (as defined by David Lewis in his book of the same name) is a continuation of the same behavior on the expectation that everyone else will continue the same behavior (or a complementary behavior like understanding you when you speak). how the behavior gets started in the first place doesn't actually matter. it can be established by precendent, by analogy to another situation, by fictional or second-hand analogy to a similar situation, or so on. we try to make language as useful as we can without rocking the boat too much. each individual could switch keyboards or create a private language or make up slang, but for a change to be recognized as part of the language it has to occur on a much wider scale. in fact, it's probably most likely to occur on a wide scale if it's caused by the apparatuses we all use (more or less the same way) to create language.

have i gone in a circle yet with this? just because we CHOOSE a technology for some arbitrary reason doesn't make it any less integral to language than the structures of the mouth or the brain. how a convention begins doesn't matter. if we all keep using it, it will continue to work, and if it keeps working we will continue to use it. there. there's the circle.

i think what i'd like to get at is that all parts of language (phonology, syntax, semantics, etc.) are fundamentally shaped and changed by these processes which are not logical or rational but conventional. what are the odds that structures that formed conventionally over thousands of years and accidents can be re-written in binary or reduced to logic? slim to none, i'd think. why do we formulate 'if then' statements for phonological rules and binary trees to explain syntactic structures?

maybe those are just the conventions we're born into.