INDEX
Explanations
phrases indicating repetition or routine
New Auto-Interp
Negative Logits
rell
-0.17
rel
-0.14
ãĥ³ãĤ¸
-0.14
ISTER
-0.14
egas
-0.14
ALT
-0.14
DATES
-0.14
غات
-0.14
matcher
-0.14
Rank
-0.14
POSITIVE LOGITS
bai
0.17
jal
0.15
bite
0.14
eins
0.14
census
0.14
izik
0.14
iger
0.14
ncia
0.14
ols
0.14
bite
0.13
Activations Density 0.045%