INDEX
Explanations
special characters and punctuation marks
New Auto-Interp
Negative Logits
NameInMap
-0.53
lances
-0.50
BoxFit
-0.50
recensement
-0.49
UserContext
-0.48
messageInfo
-0.48
كويكب
-0.48
Aptitude
-0.47
scal
-0.46
:+:
-0.46
POSITIVE LOGITS
Monfieur
0.82
purpoſe
0.72
ſeveral
0.72
Houſe
0.71
myſelf
0.71
Jefus
0.71
[toxicity=0]
0.69
########.
0.68
Diſ
0.67
Majefty
0.66
Activations Density 0.021%