INDEX
Explanations
the word "rather" in various contexts and forms
New Auto-Interp
Negative Logits
s
-0.17
rades
-0.17
ury
-0.16
isphere
-0.15
nad
-0.15
Anton
-0.14
sr
-0.14
ilater
-0.14
Bulk
-0.14
URY
-0.14
POSITIVE LOGITS
ìĦľëĬĶ
0.17
indr
0.17
oner
0.16
»¿
0.15
atik
0.15
ovÃŃ
0.15
rather
0.15
rather
0.15
abic
0.14
.ly
0.14
Activations Density 0.015%