INDEX
Explanations
phrases related to physical structures or formations
phrases indicating abundance or quantity
New Auto-Interp
Negative Logits
acus
-0.80
stride
-0.72
oking
-0.70
hra
-0.69
riber
-0.68
century
-0.68
jad
-0.68
tarians
-0.67
ibe
-0.67
bowl
-0.66
POSITIVE LOGITS
sorts
1.28
goodies
0.86
assorted
0.77
pixels
0.75
extras
0.74
colorful
0.74
disparate
0.74
stars
0.72
rubble
0.71
seats
0.70
Activations Density 0.122%