INDEX
Explanations
phrases involving assessment or evaluation
New Auto-Interp
Negative Logits
ona
-0.16
ardu
-0.16
sequential
-0.16
apolis
-0.15
xDA
-0.15
-FIRST
-0.14
imits
-0.14
ard
-0.14
Bullet
-0.14
arg
-0.14
POSITIVE LOGITS
INGER
0.16
eliac
0.16
ex
0.15
opensource
0.14
elen
0.14
bow
0.14
вав
0.14
elli
0.14
ingles
0.14
IRR
0.14
Activations Density 0.004%