INDEX
Explanations
phrases related to sucking
New Auto-Interp
Neuron Alignment
Index
Value
% of L₁
1757
+0.09
0.3%
1921
+0.07
0.2%
1047
+0.07
0.2%
Correlated Neurons
Index
P. Corr.
Cos Sim.
701
+0.09
0.02
1836
+0.07
0.02
1638
+0.07
0.02
Negative Logits
/***
-0.59
tortas
-0.59
pican
-0.56
telur
-0.54
żdy
-0.53
Fuckin
-0.52
iniums
-0.51
Địa
-0.51
habited
-0.50
mostaza
-0.49
POSITIVE LOGITS
suck
2.81
sucked
2.26
sucking
2.15
sucks
2.13
Suck
2.13
suck
2.06
Suck
1.80
suckers
1.26
sucker
1.20
stinks
1.14
Activations Density 0.123%