INDEX
Explanations
verbs in various forms indicating states of being or actions related to individuals and their circumstances
New Auto-Interp
Negative Logits
ance
-0.18
acr
-0.17
smo
-0.15
amar
-0.15
Memorial
-0.14
ão
-0.14
hee
-0.14
Cli
-0.14
iedo
-0.14
upp
-0.14
POSITIVE LOGITS
ÃĹ↵↵
0.16
nda
0.15
itzer
0.15
izophren
0.15
.googleapis
0.15
kå
0.15
iscard
0.15
tens
0.15
Urb
0.14
nde
0.14
Activations Density 0.214%