INDEX
Explanations
phrases where a statement is emphasized or confirmed
repeated phrases indicating agreement or affirmation
New Auto-Interp
Negative Logits
ĸļ
-0.69
arette
-0.65
ripp
-0.62
mat
-0.61
ains
-0.61
igmat
-0.60
ature
-0.60
ipation
-0.60
è¦ļéĨĴ
-0.60
odium
-0.59
POSITIVE LOGITS
eous
1.22
wing
0.79
wing
0.79
move
0.76
shore
0.76
winger
0.70
å¾
0.69
hand
0.68
mares
0.68
fielder
0.68
Activations Density 0.053%