INDEX
Explanations
phrases related to exploration and investigation
terms related to measurement and quantities
New Auto-Interp
Negative Logits
Gladiator
-0.82
Brush
-0.81
Blossom
-0.78
Painter
-0.78
Cousins
-0.77
Rampage
-0.77
Clouds
-0.77
Zuckerberg
-0.76
Exploration
-0.76
Shock
-0.76
POSITIVE LOGITS
ma
1.64
mi
1.54
mu
1.52
nic
1.49
li
1.46
cas
1.45
ja
1.45
sam
1.45
nu
1.45
il
1.45
Activations Density 0.108%