INDEX
Explanations
references to the sun and sunlight
New Auto-Interp
Negative Logits
Halls
-0.73
axter
-0.70
ngth
-0.69
ABE
-0.69
ussen
-0.69
ij士
-0.68
ourgeois
-0.68
Mellon
-0.67
USS
-0.67
Hodg
-0.67
POSITIVE LOGITS
flower
1.35
shine
1.29
burst
1.24
lit
1.19
beam
1.18
nah
1.14
rises
1.13
burn
1.11
spot
1.06
bat
1.05
Activations Density 0.022%