INDEX
Explanations
references to the sun
references to the sun
New Auto-Interp
Negative Logits
abulary
-0.78
Mellon
-0.75
ergus
-0.75
remlin
-0.74
icient
-0.73
Cosponsors
-0.73
ourgeois
-0.72
emetery
-0.71
razil
-0.71
kHz
-0.68
POSITIVE LOGITS
lit
1.06
sun
1.04
shine
0.99
rays
0.97
nah
0.93
light
0.90
rays
0.87
beam
0.86
lou
0.86
burn
0.84
Activations Density 0.009%