INDEX
Explanations
references to individuals' professional backgrounds and achievements
New Auto-Interp
Negative Logits
rone
-0.15
CharSet
-0.14
Drops
-0.14
ongyang
-0.13
rypton
-0.13
chl
-0.13
:normal
-0.13
otel
-0.13
ReadWrite
-0.13
kot
-0.13
POSITIVE LOGITS
wife
0.17
natives
0.16
Previous
0.15
Prior
0.15
Wife
0.15
/go
0.15
Owners
0.15
rejo
0.15
enjoys
0.15
obtaining
0.14
Activations Density 0.144%