INDEX
Explanations
phrases involving contrasting or contradictory actions or situations
auxiliary verbs indicating states or conditions
New Auto-Interp
Negative Logits
acca
-0.88
kay
-0.80
eah
-0.75
resso
-0.74
oj
-0.70
minecraft
-0.69
rongh
-0.69
UD
-0.69
Mods
-0.68
onne
-0.67
POSITIVE LOGITS
nonetheless
1.38
nevertheless
1.22
etheless
0.87
retains
0.83
retaining
0.82
retained
0.81
acknow
0.80
cautioned
0.78
alas
0.76
still
0.72
Activations Density 0.202%