INDEX
Explanations
names and specific identifiers related to individuals and entities
ind/ink + er/ver
New Auto-Interp
Negative Logits
={({-0.58
ztály
-0.56
didSet
-0.55
epa
-0.53
Eq
-0.53
sApp
-0.52
者
-0.50
$\#
-0.49
ábado
-0.49
Koval
-0.49
POSITIVE LOGITS
int
0.64
inton
0.63
Wink
0.63
ind
0.61
ink
0.61
inn
0.61
into
0.60
inta
0.60
inde
0.59
inthe
0.59
Activations Density 0.052%