INDEX
Explanations
numerical lists or bullet points
lists or enumerations of items
New Auto-Interp
Negative Logits
scenes
-0.67
awaru
-0.67
espie
-0.67
allowed
-0.66
manag
-0.62
sway
-0.62
leading
-0.61
hedon
-0.61
wagen
-0.60
ethics
-0.60
POSITIVE LOGITS
Password
1.28
st
1.14
Corinthians
1.12
125
1.00
120
0.98
123
0.96
½
0.88
000000
0.87
128
0.86
ST
0.82
Activations Density 0.049%