INDEX
Explanations
phrases that imply inquiry or confusion
New Auto-Interp
Negative Logits
bara
-0.92
termination
-0.75
abase
-0.74
alore
-0.69
etheless
-0.69
perial
-0.67
chwitz
-0.66
anian
-0.66
farm
-0.66
terness
-0.66
POSITIVE LOGITS
Atk
0.72
Sort
0.69
Sort
0.68
Type
0.68
é¾įå
0.67
Likes
0.65
Bust
0.65
ivals
0.64
Prev
0.63
Favorite
0.62
Activations Density 0.009%