INDEX
Explanations
proper nouns, specifically names related to people
references to the name "ATT" and its variations
New Auto-Interp
Negative Logits
Galile
-0.78
Britann
-0.75
Commando
-0.73
Gork
-0.67
Newfoundland
-0.67
Colombian
-0.65
Bosnia
-0.64
Uzbek
-0.64
methyl
-0.63
Bind
-0.62
POSITIVE LOGITS
anooga
1.44
ributed
1.33
ributes
1.30
leground
1.17
itude
1.17
ainment
1.15
achment
1.13
idy
1.11
orney
1.11
alion
1.09
Activations Density 0.012%