INDEX
Explanations
quantifiers indicating frequency or prevalence
New Auto-Interp
Negative Logits
Isabella
-0.50
Belt
-0.47
Isabella
-0.47
ring
-0.42
mjs
-0.41
belt
-0.40
Belt
-0.40
Museum
-0.40
Download
-0.39
anel
-0.38
POSITIVE LOGITS
antMatchers
0.75
fleste
0.69
ždý
0.62
flesta
0.61
Jeder
0.56
meeste
0.56
plupart
0.56
kebanyakan
0.56
meisten
0.55
Cualquier
0.55
Activations Density 0.406%