INDEX
Explanations
phrases related to making decisions or choices
references to competition or selection processes
New Auto-Interp
Negative Logits
unctions
-0.73
venants
-0.72
ossibility
-0.65
unction
-0.65
quartered
-0.64
llah
-0.63
ouble
-0.63
enegger
-0.62
urry
-0.62
taboola
-0.61
POSITIVE LOGITS
liest
0.87
versus
0.81
?",
0.79
?,
0.77
vs
0.72
depends
0.69
FN
0.68
hardest
0.68
Accessory
0.67
weakest
0.65
Activations Density 0.271%