INDEX
Explanations
the occurrence of the word "Global" at varying activation levels in the text
references to the term "Global" in various contexts
New Auto-Interp
Negative Logits
chers
-0.83
manship
-0.81
INO
-0.76
FUL
-0.76
ÄŁ
-0.75
oÄŁ
-0.75
Ô
-0.74
deen
-0.74
_-
-0.72
HOU
-0.72
POSITIVE LOGITS
ization
0.99
Position
0.88
izing
0.88
warming
0.88
warming
0.87
Witness
0.85
izable
0.84
Affairs
0.84
isation
0.84
Entry
0.81
Activations Density 0.020%