INDEX
Explanations
the word "but" followed by a description or contrast
instances of contrast or exception
New Auto-Interp
Negative Logits
lement
-0.73
ealing
-0.66
ammed
-0.65
esc
-0.65
ISH
-0.63
à¥
-0.63
aez
-0.63
legate
-0.62
adr
-0.62
IZ
-0.62
POSITIVE LOGITS
none
1.35
nothing
1.14
mostly
1.10
generally
1.01
invariably
1.00
ultimately
1.00
alas
1.00
nowhere
0.99
overall
0.99
chiefly
0.97
Activations Density 0.239%