INDEX
Explanations
repeated phrases and references to actions, particularly those beginning with "to."
New Auto-Interp
Negative Logits
jenigen
-0.54
lapsingToolbar
-0.51
ьере
-0.51
loem
-0.50
jenige
-0.48
חיצוניים
-0.47
angekommen
-0.46
jaros
-0.45
ⓘ
-0.45
akrab
-0.45
POSITIVE LOGITS
perfection
0.85
death
0.83
فريبيس
0.75
propOrder
0.70
death
0.69
pieces
0.65
DEATH
0.63
shreds
0.63
Seeder
0.63
distraction
0.62
Activations Density 0.164%