INDEX
Explanations
phrases related to various individuals and proper nouns
frequent mentions of the word "ra" and references to economic figures and entities
New Auto-Interp
Negative Logits
kson
-0.71
izabeth
-0.68
ocene
-0.64
Awakens
-0.63
ĩ
-0.62
Aval
-0.60
Papua
-0.60
Matthews
-0.60
OTOS
-0.59
Swanson
-0.59
POSITIVE LOGITS
sidx
0.88
fters
0.87
aca
0.84
eal
0.83
ental
0.81
ught
0.79
asar
0.79
idem
0.78
fide
0.76
inals
0.76
Activations Density 0.051%