INDEX
Explanations
words related to size or magnitude
New Auto-Interp
Negative Logits
classNames
-0.66
epiece
-0.65
sanitarias
-0.63
alluminio
-0.61
professionale
-0.61
poésie
-0.61
estatales
-0.61
esterna
-0.60
titian
-0.60
során
-0.59
POSITIVE LOGITS
big
2.73
huge
2.15
big
2.01
biggest
1.90
bigger
1.88
BIG
1.77
huge
1.75
HUGE
1.72
large
1.72
biggest
1.66
Activations Density 0.049%