INDEX
Explanations
mentions of the word "humanity"
mentions of humanity and its implications
New Auto-Interp
Negative Logits
raph
-0.70
urations
-0.69
skim
-0.66
engers
-0.66
Bey
-0.65
abet
-0.64
FEC
-0.64
OHN
-0.63
INO
-0.61
Rough
-0.61
POSITIVE LOGITS
ankind
1.00
beings
0.95
zee
0.76
arily
0.74
zees
0.72
ÃŃs
0.72
itably
0.71
geist
0.70
inhab
0.69
flourishing
0.69
Activations Density 0.022%