INDEX
Explanations
proper nouns related to political figures and historical events
repeated mentions of the name "Heisenberg."
New Auto-Interp
Negative Logits
permitting
-0.70
branding
-0.69
street
-0.64
sustainable
-0.63
block
-0.61
permit
-0.60
routing
-0.60
banning
-0.60
home
-0.60
Fat
-0.59
POSITIVE LOGITS
isen
5.01
izen
1.35
Eisen
1.28
iken
1.25
inen
1.19
isin
1.13
ises
1.12
osen
1.10
ISE
1.09
isa
1.09
Activations Density 0.007%