INDEX
Explanations
unique or special items referred to as "Original" or "original."
repeated mentions of the term "Original" in various contexts
New Auto-Interp
Negative Logits
wal
-0.85
ega
-0.71
rosso
-0.71
roph
-0.69
ostics
-0.68
ucket
-0.67
robe
-0.67
rolet
-0.66
rology
-0.66
walk
-0.65
POSITIVE LOGITS
ity
1.42
ITY
0.92
incarnation
0.88
sin
0.86
impetus
0.86
intent
0.83
conception
0.81
trilogy
0.81
screenplay
0.80
intention
0.79
Activations Density 0.028%