INDEX
Explanations
phrases related to measuring the size and shape of objects
occurrences of the word "and" related to various contexts
New Auto-Interp
Negative Logits
gets
-0.85
rs
-0.74
Pigs
-0.73
Kirin
-0.72
aroo
-0.70
hov
-0.69
atican
-0.69
ouls
-0.68
wives
-0.68
KK
-0.68
POSITIVE LOGITS
likeness
1.15
temperament
0.98
uniqueness
0.96
characteristics
0.94
stature
0.94
texture
0.93
elevation
0.92
heterogeneity
0.92
geography
0.91
composition
0.90
Activations Density 0.358%