INDEX
Explanations
adverbs used to convey intensity or emphasis
expressions related to contemplation and evaluation
New Auto-Interp
Negative Logits
adra
-0.79
catentry
-0.71
ante
-0.66
edition
-0.64
ario
-0.63
%"
-0.61
é»Ĵ
-0.60
ource
-0.60
endor
-0.58
riott
-0.58
POSITIVE LOGITS
Bout
0.74
unthinkable
0.66
provoking
0.66
oby
0.64
aloud
0.64
ti
0.64
cho
0.63
mas
0.63
orth
0.62
esian
0.62
Activations Density 0.132%