INDEX
Explanations
phrases related to criteria and processes for selection or assessment
New Auto-Interp
Negative Logits
ters
-0.15
rze
-0.15
нÑıÑĤи
-0.14
³
-0.13
ãĤĤãģĨ
-0.13
ÑĪе
-0.13
kö
-0.13
atron
-0.13
intr
-0.13
dept
-0.13
POSITIVE LOGITS
especially
0.28
particularly
0.24
especially
0.21
оÑģобенно
0.20
especialmente
0.20
pecially
0.19
çī¹åĪ«
0.18
particularly
0.17
оÑģобливо
0.17
together
0.16
Activations Density 0.075%