INDEX
Explanations
mentions of or related to humans
occurrences and discussions of humans
New Auto-Interp
Negative Logits
Line
-0.67
rent
-0.67
forth
-0.67
Coun
-0.67
abb
-0.65
PM
-0.64
acc
-0.63
Stud
-0.62
Sutherland
-0.62
Scarborough
-0.61
POSITIVE LOGITS
beings
1.18
humans
0.97
Humans
0.96
folk
0.96
readable
0.82
omorphic
0.82
oids
0.81
mortals
0.76
izont
0.75
zee
0.72
Activations Density 0.012%