INDEX
Explanations
descriptors related to effort, difficulty, or intensity
New Auto-Interp
Negative Logits
Monat
-0.49
cejas
-0.46
MenuItem
-0.45
burbujas
-0.44
Figure
-0.42
forsø
-0.42
bricolaje
-0.42
disfrute
-0.41
ípio
-0.41
Figuren
-0.40
POSITIVE LOGITS
Hard
0.90
Hard
0.88
faſt
0.86
Hardin
0.78
soft
0.78
HARD
0.77
Soft
0.77
Fast
0.75
httphttps
0.75
Soft
0.74
Activations Density 0.098%