INDEX
Explanations
phrases related to comparison or interaction between individuals
references to "other people" or interactions involving them
New Auto-Interp
Negative Logits
Petraeus
-0.67
wark
-0.64
rought
-0.64
olt
-0.64
ukemia
-0.63
Lans
-0.59
Canaver
-0.59
udence
-0.58
ucc
-0.58
ysis
-0.57
POSITIVE LOGITS
worldly
1.84
peoples
1.22
people
1.02
person
1.00
wise
0.95
persons
0.91
world
0.88
kinds
0.88
humans
0.87
countries
0.86
Activations Density 0.097%