INDEX
Explanations
references to the name "Taft."
New Auto-Interp
Negative Logits
oir
-0.85
gc
-0.79
fu
-0.77
bors
-0.73
fw
-0.71
monds
-0.70
roe
-0.69
oku
-0.67
tes
-0.67
mas
-0.67
POSITIVE LOGITS
hess
0.79
xton
0.75
esthesia
0.67
ub
0.64
metic
0.61
existence
0.60
DOS
0.60
htaking
0.59
verty
0.59
ebook
0.59
Activations Density 0.048%