INDEX
Explanations
attributes related to condition and quality assessments
New Auto-Interp
Negative Logits
'*':
-0.90
—
-0.87
NDEBUG
-0.84
'-':
-0.83
الحياه
-0.80
imidlertid
-0.79
―――――
-0.78
[],
-0.76
freilich
-0.74
ſind
-0.73
POSITIVE LOGITS
ect
0.95
aswell
0.88
alot
0.83
thats
0.79
etc
0.77
haha
0.77
atleast
0.67
&
0.67
INCLUDING
0.67
BUT
0.66
Activations Density 0.993%