INDEX
Explanations
proper names, specifically those related to individuals and their contributions in various contexts
New Auto-Interp
Negative Logits
irth
-0.17
ryn
-0.16
insula
-0.14
itori
-0.14
osp
-0.14
apult
-0.14
treff
-0.14
GameController
-0.14
ombok
-0.13
ambah
-0.13
POSITIVE LOGITS
pat
0.21
Pat
0.20
_pat
0.18
Pat
0.18
.pat
0.17
åĢī
0.17
ãĤº
0.16
(pat
0.16
pat
0.15
satur
0.15
Activations Density 0.026%