INDEX
Explanations
references to the concept of "couples" or "pairing"
New Auto-Interp
Negative Logits
Lilian
-0.74
TEXT
-0.74
Dia
-0.74
hintText
-0.72
Eichen
-0.69
ness
-0.68
Benn
-0.68
Dia
-0.68
MSR
-0.67
SLS
-0.66
POSITIVE LOGITS
couple
1.79
Couple
1.73
couple
1.71
Couple
1.62
couples
1.46
Couples
1.39
couples
1.30
COU
1.26
casal
1.06
COUP
1.01
Activations Density 0.055%