INDEX
Explanations
adjectives and adverbs with specific endings
New Auto-Interp
Negative Logits
Riders
-0.80
Newsp
-0.71
Royals
-0.70
Chiefs
-0.69
RJ
-0.69
anche
-0.69
Seasons
-0.68
Pirate
-0.67
OC
-0.66
anca
-0.66
POSITIVE LOGITS
inclined
0.89
Gleaming
0.89
minded
0.88
ãĤ©
0.87
speaking
0.75
ascus
0.74
opposed
0.72
diverse
0.72
indistinguishable
0.71
Ü
0.70
Activations Density 0.016%