INDEX
Explanations
terms and references related to civil rights
New Auto-Interp
Negative Logits
oon
-0.17
lem
-0.15
loy
-0.15
oops
-0.15
emes
-0.15
imate
-0.14
Stranger
-0.14
aves
-0.14
emo
-0.14
ino
-0.14
POSITIVE LOGITS
ucu
0.15
acket
0.15
İ·
0.14
shutter
0.14
altar
0.14
zdy
0.14
ãĥ¢ãĥ³
0.14
bote
0.13
jez
0.13
rams
0.13
Activations Density 0.012%