INDEX
Explanations
instances of back-and-forth motion or actions involving transitions
New Auto-Interp
Negative Logits
Administrativna
-0.52
urger
-0.47
Shuk
-0.46
acidic
-0.46
Mercantile
-0.45
islanders
-0.44
CanadaChoose
-0.44
Aimee
-0.43
ernet
-0.43
ferrer
-0.43
POSITIVE LOGITS
BATH
0.95
bath
0.94
BATH
0.93
Bath
0.92
mth
0.91
Bath
0.91
bath
0.91
ETH
0.90
meth
0.88
Beth
0.88
Activations Density 0.695%