INDEX
Explanations
significant nouns and verbs that indicate importance or attention in context
New Auto-Interp
Negative Logits
Teddy
-0.16
/by
-0.16
iT
-0.15
itr
-0.14
Tir
-0.14
aset
-0.14
terr
-0.14
_approved
-0.13
Dram
-0.13
Juda
-0.13
POSITIVE LOGITS
leted
0.17
ãĥ³ãĤº
0.15
ίκ
0.15
ousse
0.15
etten
0.15
ÏĦεÏį
0.15
orest
0.14
дÑĥмкÑĥ
0.14
elman
0.14
ноÑģÑĤ
0.14
Activations Density 0.030%