INDEX
Explanations
words related to physical movement, specifically the action of swiping
occurrences of the word "sw" in various contexts
New Auto-Interp
Negative Logits
inelli
-0.86
onics
-0.83
endment
-0.80
OPA
-0.76
Constantin
-0.74
viation
-0.74
oglu
-0.74
otaur
-0.73
Church
-0.71
exist
-0.71
POSITIVE LOGITS
sw
3.64
Sw
1.95
sw
1.77
Sw
1.75
swat
1.65
swipe
1.43
swim
1.33
swarm
1.28
SW
1.25
swamp
1.22
Activations Density 0.010%