INDEX
Explanations
strings of text ending with a special character "Ċ" followed by a numeric value indicating the strength of activation
phrases that emphasize strong emotions or personal preferences
New Auto-Interp
Negative Logits
Fu
-0.66
flats
-0.61
senal
-0.59
Alb
-0.59
manned
-0.59
wearer
-0.56
Skydragon
-0.56
Hispan
-0.55
Racer
-0.55
mansion
-0.54
POSITIVE LOGITS
assian
0.83
↵
0.81
DragonMagazine
0.79
until
0.78
³³³
0.77
database
0.75
termin
0.74
̶
0.74
³³³³³³³³³³³³³³³³
0.74
til
0.73
Activations Density 0.180%