INDEX
Explanations
comparative phrases that contrast two concepts or ideas
New Auto-Interp
Negative Logits
habet
-0.58
étoient
-0.55
že
-0.55
avoient
-0.52
ſmall
-0.51
verständlich
-0.51
olympique
-0.50
fevere
-0.50
abrasion
-0.49
solidarité
-0.49
POSITIVE LOGITS
raczej
0.95
скорее
0.89
eher
0.79
unknownFields
0.78
рее
0.73
inkább
0.71
חיצוניים
0.69
NameInMap
0.68
,:]
0.68
rather
0.67
Activations Density 0.158%