INDEX
Explanations
mentions of the word “human”
references to the human condition and human experiences
New Auto-Interp
Negative Logits
Transcript
-0.87
abb
-0.77
forth
-0.74
Reloaded
-0.70
INO
-0.69
REP
-0.68
arella
-0.67
Christensen
-0.66
OHN
-0.65
armac
-0.65
POSITIVE LOGITS
beings
1.22
readable
0.97
rights
0.94
istic
0.93
ized
0.88
itar
0.88
rights
0.83
fingert
0.83
embryonic
0.83
izes
0.81
Activations Density 0.024%