INDEX
Explanations
specific actions or objects related to physical altercations or conflicts
mentions of specific objects, individuals, or significant topics within a narrative
New Auto-Interp
Negative Logits
ĪĴ
-0.53
¿½
-0.52
Emin
-0.51
Leilan
-0.49
confir
-0.48
conclud
-0.47
surpr
-0.46
Rampage
-0.46
Aires
-0.45
ãĥ´
-0.44
POSITIVE LOGITS
badge
0.56
onto
0.52
hostage
0.48
button
0.45
Seal
0.45
pes
0.45
reins
0.45
onto
0.45
curse
0.45
securely
0.44
Activations Density 1.656%