INDEX
Explanations
phrases expressing equivalence or comparisons
comparisons indicating equivalence in various contexts
New Auto-Interp
Negative Logits
stal
-0.82
hra
-0.77
spe
-0.76
stra
-0.74
oard
-0.74
Bomb
-0.68
omen
-0.67
bean
-0.67
Roads
-0.65
Mush
-0.65
POSITIVE LOGITS
ivalent
0.84
lihood
0.83
isons
0.82
imately
0.80
amounts
0.77
terday
0.77
icut
0.74
aminer
0.72
equivalent
0.72
oreal
0.72
Activations Density 0.016%