INDEX
Explanations
No Explanations Found
New Auto-Interp
Negative Logits
uckland
-0.99
otton
-0.74
ishable
-0.72
è¦ļéĨĴ
-0.70
ushima
-0.68
animous
-0.67
arsity
-0.67
uggle
-0.66
ãĤĵ
-0.66
ttp
-0.66
POSITIVE LOGITS
like
1.12
LIKE
0.87
geist
0.81
likes
0.81
akin
0.80
liking
0.79
Like
0.77
boom
0.74
liked
0.72
unlike
0.69
Activations Density 0.000%
No Known Activations
This feature has no known activations.