INDEX
Explanations
statements about evaluation or reflection on a given situation or action
New Auto-Interp
Negative Logits
tables
-0.65
races
-0.64
stripe
-0.64
marg
-0.63
fishes
-0.62
Afgh
-0.61
Grac
-0.61
lou
-0.60
LV
-0.59
cuff
-0.58
POSITIVE LOGITS
nevertheless
0.98
etheless
0.96
senal
0.95
nonetheless
0.92
obyl
0.91
likewise
0.90
¿½
0.90
also
0.86
Ĥİ
0.85
ŃĶ
0.80
Activations Density 1.725%