INDEX
Explanations
mentions of specific names or surnames
proper nouns, specifically names of people
New Auto-Interp
Negative Logits
soDeliveryDate
-0.60
Els
-0.59
ãĥ¼ãĥ³
-0.55
corrid
-0.55
comprom
-0.54
subscript
-0.54
horizont
-0.54
mathemat
-0.52
cyt
-0.51
governors
-0.50
POSITIVE LOGITS
Jr
1.04
Sr
0.87
III
0.75
aka
0.69
wine
0.68
velt
0.67
gart
0.66
stadt
0.65
sson
0.65
kamp
0.63
Activations Density 0.467%