INDEX
Explanations
references to specific authors and their works
New Auto-Interp
Negative Logits
VX
-0.15
ÑģÑĤвом
-0.14
uxt
-0.14
vez
-0.14
Trotsky
-0.14
iona
-0.14
à¥Īल
-0.14
_IOC
-0.14
Nude
-0.14
hoff
-0.14
POSITIVE LOGITS
Stephen
0.35
Penny
0.34
IT
0.33
King
0.30
Stephen
0.28
IT
0.28
Maine
0.27
Bill
0.27
_IT
0.25
King
0.24
Activations Density 0.008%