INDEX
Explanations
the word "Ir" with a high activation value
mentions of "Ir" (likely a reference to a specific character or entity) throughout the document
New Auto-Interp
Negative Logits
pop
-0.73
welcome
-0.70
pop
-0.70
solo
-0.70
ality
-0.65
chambers
-0.65
popping
-0.65
ango
-0.64
MON
-0.64
alog
-0.63
POSITIVE LOGITS
Ir
3.70
Ir
2.56
ir
1.44
Il
1.30
Er
1.30
IR
1.28
Irving
1.28
Kir
1.16
Imper
1.15
Irwin
1.15
Activations Density 0.017%