INDEX
Explanations
punctuation marks at the end of a sentence
phrases related to match-ups or comparisons
New Auto-Interp
Negative Logits
migrated
-0.76
steered
-0.73
affected
-0.73
iosyn
-0.72
danced
-0.71
unden
-0.71
wcs
-0.69
enthusi
-0.69
ande
-0.68
ancest
-0.68
POSITIVE LOGITS
nil
0.75
jpg
0.68
ours
0.68
TBD
0.67
Slay
0.66
Typh
0.66
illian
0.65
ItemTracker
0.65
luaj
0.64
mast
0.64
Activations Density 0.028%