INDEX
Explanations
capital letters and numbers in specific sequences
specific letter or number sequences and their representations within a structured data context
New Auto-Interp
Negative Logits
ification
-0.66
has
-0.63
rollers
-0.60
estate
-0.59
Socialism
-0.59
lif
-0.58
lihood
-0.58
collection
-0.58
think
-0.55
offic
-0.55
POSITIVE LOGITS
OTO
1.09
NW
1.01
ERO
0.99
UE
0.99
-.
0.97
RL
0.97
VI
0.95
OPA
0.93
II
0.92
EGIN
0.92
Activations Density 0.110%