INDEX
Explanations
mentions of specific events or people's names
occurrences of the name "atra" and related references, likely within a political or social context
New Auto-Interp
Negative Logits
GF
-0.70
igating
-0.69
Trend
-0.68
PG
-0.67
rats
-0.67
picking
-0.67
ective
-0.67
Stam
-0.66
suspic
-0.66
onet
-0.65
POSITIVE LOGITS
é¾įåĸļ士
0.85
issance
0.83
agne
0.80
zzo
0.80
Mae
0.77
phrine
0.76
sonian
0.75
bows
0.75
enei
0.74
Jinn
0.72
Activations Density 0.020%