INDEX
Explanations
adverbs that express an opinion or evaluation
New Auto-Interp
Negative Logits
ories
-0.64
aspiration
-0.62
SourceFile
-0.59
Variant
-0.58
mates
-0.58
CLOSE
-0.57
(>
-0.57
nect
-0.56
MAL
-0.56
/
-0.56
POSITIVE LOGITS
anecd
0.73
unlike
0.71
according
0.67
concedes
0.67
adle
0.67
âķIJâķIJ
0.66
ignores
0.66
requires
0.64
has
0.64
acknowledges
0.63
Activations Density 0.102%