INDEX
Explanations
comparisons between different groups, with a specific focus on specific groups being compared to their counterparts
references to counterparts in various contexts, emphasizing comparisons or relationships between groups or entities
New Auto-Interp
Negative Logits
aping
-0.76
cer
-0.73
cloth
-0.73
raz
-0.71
wood
-0.68
estern
-0.68
frey
-0.68
Fulton
-0.65
ousel
-0.65
Chain
-0.62
POSITIVE LOGITS
counterparts
1.16
counterpart
1.02
hip
0.89
DragonMagazine
0.88
reluct
0.79
itutes
0.78
itute
0.78
isons
0.76
thereof
0.74
MpServer
0.74
Activations Density 0.007%