INDEX
Explanations
phrases indicating understanding or clarity regarding concepts
"make sense" or variations
make sense phrases
New Auto-Interp
Negative Logits
nahilalakip
-0.75
&___
-0.66
oprot
-0.64
saites
-0.62
متعلقه
-0.61
Heights
-0.56
للمعارف
-0.56
ulet
-0.55
RegressionTest
-0.55
titleMargin
-0.53
POSITIVE LOGITS
sense
2.31
sense
1.80
SENSE
1.57
sentido
1.49
Sense
1.48
Sense
1.38
senso
1.28
senses
1.21
sens
1.09
ense
0.96
Activations Density 0.130%