INDEX
Explanations
phrases highlighting comparisons or contrasts
New Auto-Interp
Negative Logits
ToBounds
-0.90
Gunn
-0.71
meningitis
-0.71
uuidv
-0.67
lid
-0.65
ActivityCompat
-0.65
labelledby
-0.64
Williamson
-0.64
ظيم
-0.64
Hass
-0.63
POSITIVE LOGITS
uttosto
1.14
Rather
0.96
Rather
0.92
rather
0.84
rather
0.84
Vrij
0.73
THON
0.73
Fairly
0.71
Notion
0.71
terly
0.71
Activations Density 0.081%