INDEX
Explanations
phrases related to human existence or characteristics
references to the concept of being human
New Auto-Interp
Negative Logits
Els
-0.73
Mines
-0.71
NRS
-0.70
éĹĺ
-0.70
Passage
-0.64
ories
-0.63
redundancy
-0.62
ICE
-0.62
yss
-0.62
Deadly
-0.61
POSITIVE LOGITS
alive
0.91
hood
0.81
who
0.81
lived
0.81
judged
0.79
inhab
0.78
born
0.76
endowed
0.75
atos
0.74
beings
0.74
Activations Density 0.030%