INDEX
Explanations
instances of individuals or topics being publicly discussed or brought to attention
New Auto-Interp
Negative Logits
ongyang
-0.16
mare
-0.15
Riy
-0.14
Lebens
-0.14
ayne
-0.14
iyon
-0.14
emphasis
-0.13
á»ijt
-0.13
observeOn
-0.13
γά
-0.13
POSITIVE LOGITS
publicly
0.20
pública
0.19
about
0.17
.public
0.16
public
0.16
openly
0.15
púb
0.15
public
0.15
/public
0.15
(public
0.14
Activations Density 0.080%