INDEX
Explanations
references to comparisons or duality between subjects
New Auto-Interp
Negative Logits
Yucatán
-0.68
er
-0.67
Smoky
-0.65
های
-0.64
residue
-0.63
Amelia
-0.63
Creole
-0.62
Rena
-0.61
monica
-0.60
ckley
-0.60
POSITIVE LOGITS
both
2.06
BOTH
1.98
both
1.94
Both
1.86
Both
1.84
BOTH
1.75
Ambos
1.61
beide
1.46
ambos
1.40
beider
1.38
Activations Density 0.096%