INDEX
Explanations
assertions of excellence and high quality in various contexts
positive descriptors
New Auto-Interp
Negative Logits
bigger
-0.52
sterious
-0.50
funnier
-0.47
big
-0.42
bigger
-0.42
softer
-0.41
wetter
-0.41
biggest
-0.40
Biggest
-0.40
BIG
-0.39
POSITIVE LOGITS
Excellent
1.16
Excellent
1.16
excellent
1.14
excellent
1.11
excelentes
1.01
excelente
0.98
excelente
0.95
eccellente
0.94
excellente
0.91
excell
0.91
Activations Density 0.018%