INDEX
Explanations
the word "so" for the purpose of signaling reasoning or consequence
conjunctions that imply causation or conclusion
New Auto-Interp
Negative Logits
mast
-0.64
ammy
-0.64
inch
-0.61
女
-0.56
breast
-0.55
sha
-0.55
ICT
-0.54
Mens
-0.54
ixie
-0.54
thro
-0.54
POSITIVE LOGITS
bered
1.22
othe
1.13
oner
1.11
apy
1.00
othes
0.96
FTWARE
0.88
aps
0.87
ooo
0.82
oooo
0.82
iled
0.80
Activations Density 0.068%