INDEX
Explanations
phrases related to comparison, competition, and standards
phrases that involve a comparative or competitive context
New Auto-Interp
Negative Logits
Appeal
-0.60
igmat
-0.59
Profile
-0.56
":["
-0.55
Edited
-0.55
anism
-0.55
Completed
-0.55
â̦]
-0.53
abal
-0.52
Quote
-0.52
POSITIVE LOGITS
sticks
0.73
roses
0.70
devil
0.69
fishes
0.67
chips
0.65
hell
0.63
heck
0.62
teeth
0.60
Roses
0.59
punches
0.59
Activations Density 0.724%