INDEX
Explanations
mentions of specific locations and events
instances of the character "L" or related terms in various contexts
New Auto-Interp
Negative Logits
decomp
-0.79
JPEG
-0.70
scrim
-0.66
comb
-0.65
contempor
-0.65
photoc
-0.63
elim
-0.63
compost
-0.63
metaphor
-0.62
pyramid
-0.61
POSITIVE LOGITS
s
1.59
ski
1.15
ship
1.08
sf
1.07
tal
1.06
sen
1.02
ses
1.01
ships
1.00
sg
1.00
sin
0.99
Activations Density 0.207%