INDEX
Explanations
references to the place or title "Oz."
mentions of "Oz" and related references to the Wizard of Oz
New Auto-Interp
Negative Logits
ãģį
-0.67
ãģ¦
-0.66
fries
-0.62
trl
-0.61
RAW
-0.60
士
-0.59
++++++++++++++++
-0.59
delinqu
-0.59
inadequ
-0.58
Í
-0.58
POSITIVE LOGITS
osta
0.93
rh
0.87
leys
0.84
boro
0.83
Oz
0.82
asar
0.80
elia
0.79
ey
0.78
eki
0.77
biz
0.76
Activations Density 0.032%