INDEX
Explanations
verbs denoting action or transition
references to specific individuals or groups and their actions
New Auto-Interp
Negative Logits
âĢ¢âĢ¢
-0.66
ÃĽ
-0.65
"}],"
-0.65
Differences
-0.64
¬¼
-0.58
ieth
-0.56
soType
-0.55
Mehran
-0.54
imilation
-0.53
iencies
-0.53
POSITIVE LOGITS
kinda
1.18
basically
1.14
supposedly
1.13
apparently
1.13
thankfully
1.06
reportedly
1.05
obviously
1.04
freaking
1.03
definitely
1.03
literally
1.03
Activations Density 0.857%