INDEX
Explanations
phrases related to observation or perception
New Auto-Interp
Negative Logits
whistle
-0.73
onto
-0.68
inqu
-0.67
nesia
-0.67
eware
-0.66
ÃŃa
-0.64
agric
-0.63
EStreamFrame
-0.63
trap
-0.62
prime
-0.60
POSITIVE LOGITS
dust
1.18
firsthand
0.92
parallels
0.87
awed
0.86
visions
0.86
lights
0.83
eye
0.83
Ĺ
0.79
fit
0.78
similarities
0.76
Activations Density 0.514%