INDEX
Explanations
phrases and structures indicating relationships and dynamics among characters or entities
New Auto-Interp
Negative Logits
argas
-0.16
iez
-0.15
aina
-0.15
eck
-0.15
Gambling
-0.15
assadors
-0.15
ief
-0.14
beck
-0.14
izontal
-0.14
被
-0.14
POSITIVE LOGITS
going
0.42
going
0.35
Going
0.29
gonna
0.28
-g
0.27
-going
0.27
Going
0.27
gon
0.26
gun
0.25
g
0.24
Activations Density 0.098%