INDEX
Explanations
words related to objects and actions in the air
references to flying or airborne elements
New Auto-Interp
Negative Logits
anan
-0.73
Priv
-0.70
Ô
-0.64
fav
-0.60
¿½
-0.60
iago
-0.59
compl
-0.59
edIn
-0.59
ior
-0.59
riv
-0.59
POSITIVE LOGITS
waves
0.98
walk
0.79
canopy
0.76
ceiling
0.75
somewhere
0.74
walker
0.72
undet
0.67
ousel
0.66
continuum
0.66
aisle
0.65
Activations Density 0.081%