4

Possible Duplicate:
Is “data” considered singular or plural?

Milton Friedman, the Nobel-prize winning economist used to threaten that he would "take away any graduate student's Ph.D. if they used the word data as a singular."

Proper usage of the word data is in the plural:

  • The data are very hard to gather
  • The findings follow directly from the data. They unambiguous support our hypotheses

I'm wondering though how you refer to data in the following sentence, "The analysis using the data should go quickly since I am familiar with them (??)"

I'm not sure how to correctly refer to data in the above sentence. Most people (including myself), find it hard to use the first two examples correctly. This last example seems even trickier.

Does anyone have advice on the right way to refer to data in the above sentence?

  • 6
    Well this Milton Friedman dude has obviously never heard of *mass nouns*. It's not a singular, it just looks like one. "A data" would be singular and no Ph.D. student would ever say that. – hippietrail Dec 03 '11 at 11:22
  • @hippietrail: I must object. Many style guides recommend treating data as plural (after all it is the Latin neuter plural). And, no, I didn't violate that rule here: it is "[the word] data was". The singular is the now rarely used datum. – Cerberus - Reinstate Monica Dec 03 '11 at 12:35
  • 1
    I'm not picking on you Cerberus. But if this is the exact wording of what Friedman said then he's either being disingenuous or ignorant or the worst kind of maven trying to blackmail people into keeping his favourite archaism alive. "The data" is ambiguous in that it works fine no matter whether you think "data" is a plural or a mass noun. It takes a verb or pronoun to differentiate. But mass nouns also take singular verbs/pronouns. Saying a mass noun is a singular noun is just plain wrong since only countable nouns can be singular. – hippietrail Dec 03 '11 at 12:46
  • 1
    The unambiguous grammatical mistake in the above sentences is using unambiguous as an adverb; it should be unambiguously. – Peter Shor Dec 03 '11 at 14:07
  • I'm basically with @hippietrail on this one, but it really is about the most tedious hoary old chestnut on ELU. I vote we specifically debar questions about plurality of "data" in the FAQ. – FumbleFingers Dec 03 '11 at 14:42
  • @Hippie: All right, but I disagree that only countable nouns should be called singular. That sounds like a very controversial position: I wonder what your arguments are. Whenever a word can be both countable and uncountable, it can be clearly observed that the singular form of the countable noun is identical to the uncountable form: I like beer — I'd like a beer, please — I'd like two beers, please. Note also that we say the beer is cold, not *the beer are cold. – Cerberus - Reinstate Monica Dec 06 '11 at 03:30
  • @Cerberus: It is exactly this ambiguity that makes it important to distinguish between "singular" and "singular form". Many people are confused by this so it's always good to help clear up the differences, and anyone consciously blurring the two on purpose when ranting to people about correctness are not helping anybody. (Don't worry I'm not calling you such a ranter but there are plenty out there and if Friedman is quoted here accurately he appears to be one.) – hippietrail Dec 06 '11 at 09:01
  • @hippietrail: Eh I have never heard of this dinstinction? I call every noun singular that both agrees with a singular verb and doesn't have a plural ending (s). Every noun that has a plural ending and agrees with plural verbs (in a specific sense) I call plural. Whenever a noun has one property plural but the other singular, there is room for discussion. But I don't see room for two different concepts of plurality here. It is only a syntactic feature, nothing deeper. – Cerberus - Reinstate Monica Dec 06 '11 at 14:23
  • 4
    @Cerberus: Well your'e not alone but you're not huffing and puffing and threatening to to fail Ph.D. students over it so it doesn't matter. But this Friedman should know better and of course we that endeavour to improve a Q&A site about language usage can only benefit from knowing how to keep these clear and explain them to seekers of knowledge (-: Native speakers usually don't know about noun countability until they study linguistics, learn some other language, try to teach English, or if they dabble in some computerised language processing. Non native speakers will come up against it though. – hippietrail Dec 06 '11 at 15:46
  • tchrist's answer to this subsequent question is also of great interest. – StoneyB on hiatus Sep 16 '12 at 19:17
  • I did some digging and here is what I found: https://medium.com/on-data-and-language/356332cdb81 – philshem Mar 28 '14 at 07:43

3 Answers3

7

This is an English question rather than statistics. Many Latin and Greek plurals are often used as singulars in English (agenda is an obvious example) and the word data is used both ways, as indeed is statistics (though with slightly different meanings).

If you want to keep your sanity and your PhD you should say something like:

The analysis should go quickly since I am familiar with the data.

Henry
  • 20,183
6

The article on data in ‘The Cambridge Guide to English Usage’ distinguishes between These data were gathered by intensive interviewing. They show . . . and This data was gathered by intensive interviewing. It shows . . . As the article points out

This second version actually expresses something slightly different from the first: it projects the data as a mass or block rather than a set of separable items. Data thereby becomes a mass noun.

You can indeed dodge the issue by changing the sentence altogether as Henry suggests. If, on the other hand, you want to confront it head on you have to decide on how you view data. It’s difficult to pronounce on the matter without knowing the context, but you should have no hesitation in using the pronoun it rather than them if you conclude from the preceding sentences of your text that data is to be seen, to repeat the words of the article, ‘as a mass or block rather than a set of separable items’.

Barrie England
  • 140,205
2

Agenda has passed the threshold to singularity for nearly everyone; but, as you say, many people still adhere to data plural. If you decide to go with traditional style, treat it as plural everywhere. That includes personal pronouns:

The analysis using the data should go quickly, since I am familiar with them.