0
MrPedantic Posted 19 years ago
Linguistics Studies

Pros & Cons of Using Google Totals

Continuing from here:


BokehAll those occure on both sides of the equation. In maths/statistics when that happens such effects are considered cancelled.
How do you know they occur equally on both sides, out of interest?

MrP
  

Top answer

Oh, come on MrP, there was no need to open a thread! Well, if you want and you're interested in discussing this... I guess it's ok.

  • Oh, come on MrP, there was no need to open a thread!
  • Well, if you want and you're interested in discussing this...
  • I guess it's ok.
  • It's just that the title seems a little strange to me: Pros and cons of using Google totals.
  • What do you mean by totals?
Free · every Monday

Get the Weekly English Kit 📬

New words, one handy idiom, and a 2-minute quiz — delivered to your inbox to keep your streak alive.

11 Answers
0
Oh, come on MrP, there was no need to open a thread! Emotion: wink Well, if you want and you're interested in discussing this... I guess it's ok.
0
You don't. We are not talking real maths here, just probability. What you do know is the same algorithm is applied to both sides of the equation/search. That means both sides are subject to the juxtapositions and disregard for punctuation and case, etc. Obviously certain specific strings may have some amount of bias but used in context as an indicator Google is a useful tool. What else is there t
0
I don't deny that it's useful; but if one string in a comparison has greater potential for ambiguity, as in this case, it's very difficult to see how we can justify a figure as precise as 38%.

For instance, on the first page for "us three are", I find:


1 reference to this thread

1 case where the occurrence is only in a link

5 cases where "us" = "U.S.
0
BokehYou don't. We are not talking real maths here, just probability.
Emotion: surprise Uh? But you are not
0
How do you know they occur equally on both sides, out of interest?
I think Bokeh is saying that it's like throwing dice. You can throw this pair of dice or that pair of dice, but if both pairs are fair dice, the probability distribution for both situations is the same. Even if they're crooked in the same way, the distributions will be the same.

Likewise,
0
Hi,

I do not know much about probability et cetera, but I do think Google povides an unprecedented corpus. The question is whether it serves the purpose, that is, leading the searcher to the intended destination. When I carry out a search about a phrase, I generally doublecheck the reult with NYTimes or bbc.c.o.uk. Sometimes the result is consistent, sometimes not.

-------------
0
LinguaphileI generally doublecheck the reult with NYTimes or bbc.c.o.uk.
That's a fair approach if you are looking for prescriptive grammar or correctness, but not for looking for the frequency of common "errors".
0
CalifJimYou can throw this pair of dice or that pair of dice, but if both pairs are fair dice, the probability distribution for both situations is the same. Even if they're crooked in the same way, the distributions will be the same.

I think with text strings, though, there are factors that we wouldn't find with dice – two strings may not have equal potential f
0
there are factors that we wouldn't find with dice
Well, you're right, of course, but that means you have to take a look at the first couple of pages to see (proportionally, roughly) how many turtles and squid you netted while fishing for shrimp. And, as I mentioned, you need to use some judgment about whether you're searching below the noise level and losing the sig
0
MrPedanticOr to take another example: in the case of "if it were" (14.9m) vs "if it was" (22m), we would not know without checking how many hits for the latter were non-counterfactual (e.g. "Not sure if it was reading that or my hardcore revision for my swiftly approaching politics exam, but yeah, I have a headache"). There isn't an equivalent ambiguity in "if it were", h

Related Questions