INDEX
Explanations
adverbs that indicate degree or intensity
New Auto-Interp
Negative Logits
ĺħ
-0.78
eday
-0.78
ODY
-0.73
Tags
-0.68
OGR
-0.68
selage
-0.67
Korea
-0.66
arella
-0.66
ydia
-0.65
OTOS
-0.65
POSITIVE LOGITS
divisive
0.86
lethal
0.81
destructive
0.81
chaotic
0.81
controversial
0.80
mythical
0.80
abras
0.80
embattled
0.80
politic
0.80
infamous
0.80
Activations Density 0.077%