INDEX
Explanations
references to the sun and related solar phenomena
New Auto-Interp
Negative Logits
עת
-0.69
=&
-0.67
---*/
-0.66
configureStore
-0.66
hringer
-0.64
osť
-0.63
zegovina
-0.61
]];
-0.61
tawesome
-0.61
Rasa
-0.61
POSITIVE LOGITS
Sun
2.02
SUN
1.94
Sun
1.90
sun
1.87
sun
1.83
SUN
1.80
Suns
1.57
suns
1.56
sunshine
1.38
soleil
1.37
Activations Density 0.054%