INDEX
Explanations
mentions of specific organizations or groups and their positions or actions
phrases that include commas, likely indicating lists or complex structures in sentences
New Auto-Interp
Negative Logits
animate
-0.69
DragonMagazine
-0.63
-,
-0.58
ãĥij
-0.58
worldly
-0.57
deed
-0.57
rats
-0.56
fw
-0.56
redundant
-0.54
ðŁĻĤ
-0.54
POSITIVE LOGITS
disagrees
1.13
agrees
1.07
tells
1.07
says
1.06
told
1.05
said
1.04
argues
1.04
testified
1.03
believes
1.03
contends
1.01
Activations Density 0.138%