INDEX
Explanations
references to pairs or groups, specifically focusing on the word "two" in various contexts
New Auto-Interp
Negative Logits
alguno
-0.72
flere
-0.70
chrétiens
-0.66
âmes
-0.66
<<<<<<<<<<<<<<
-0.64
faciles
-0.63
sauvages
-0.62
particuliers
-0.62
flera
-0.62
scolaires
-0.61
POSITIVE LOGITS
remaining
0.86
Roskov
0.82
aforementioned
0.76
closest
0.76
remaining
0.75
largest
0.71
OCCURRED
0.70
iniest
0.70
closest
0.69
poorest
0.67
Activations Density 0.195%