INDEX
Explanations
names of places or people
specific initials or acronyms associated with locations or events
New Auto-Interp
Negative Logits
idium
-1.09
Domin
-0.82
ãĥĩãĤ£
-0.78
ira
-0.77
inator
-0.77
Isa
-0.75
JD
-0.73
Ñ
-0.72
ila
-0.72
ipel
-0.71
POSITIVE LOGITS
web
0.85
web
0.73
gow
0.73
ews
0.73
Chow
0.71
watch
0.71
Greens
0.71
10
0.70
Care
0.69
Work
0.68
Activations Density 0.351%