INDEX
Explanations
elements related to interpersonal relationships and character dynamics
start of turn
New Auto-Interp
Negative Logits
myſelf
-0.58
ligiloj
-0.55
scribers
-0.55
auffi
-0.51
ſelf
-0.50
يميديا
-0.50
Parcelize
-0.50
fubject
-0.50
zoude
-0.49
ilustracja
-0.48
POSITIVE LOGITS
freaking
0.48
fucking
0.46
tbh
0.46
freakin
0.45
FUCKING
0.44
Fisch
0.44
humanity
0.42
Figue
0.42
fuckin
0.42
🤷
0.42
Activations Density 0.068%