INDEX
Explanations
phrases related to comparisons or the act of comparing different things
phrases related to comparison or evaluating differences
New Auto-Interp
Negative Logits
oÄŁ
-0.79
ther
-0.68
dar
-0.67
havoc
-0.66
gren
-0.66
jong
-0.65
ULE
-0.65
eland
-0.65
ktop
-0.65
starter
-0.64
POSITIVE LOGITS
apples
1.27
favorably
1.18
comparing
0.87
favour
0.84
between
0.82
isons
0.77
comparisons
0.77
notes
0.75
compare
0.74
Compare
0.74
Activations Density 0.046%