INDEX
Explanations
terms related to comparative measurements or analysis
New Auto-Interp
Negative Logits
inite
-0.15
tá»Ń
-0.15
pliant
-0.15
ullan
-0.15
kees
-0.15
inan
-0.15
zburg
-0.14
insky
-0.14
onis
-0.14
oola
-0.14
POSITIVE LOGITS
intermediate
0.32
Intermediate
0.28
Intermediate
0.26
intermedi
0.24
intermediary
0.24
neither
0.17
interim
0.17
Between
0.17
BETWEEN
0.17
intervening
0.17
Activations Density 0.137%