INDEX
Explanations
negative sentiments or statements regarding efficacy or success
New Auto-Interp
Negative Logits
EndProject
-0.52
/*
-0.50
unique
-0.48
tab
-0.43
Camp
-0.41
صوتيه
-0.41
rare
-0.41
ites
-0.41
="@+
-0.40
Nutz
-0.39
POSITIVE LOGITS
still
1.02
Still
0.97
still
0.92
Still
0.90
STILL
0.90
STILL
0.90
ſtill
0.89
ändå
0.83
trotzdem
0.79
ftill
0.77
Activations Density 0.397%