INDEX
Explanations
direct speech examples containing the word 'just'
New Auto-Interp
Negative Logits
ccording
-0.71
Palestin
-0.69
challeng
-0.66
Archdemon
-0.64
subsequ
-0.63
holiest
-0.62
der
-0.61
Nurs
-0.58
supervised
-0.58
PLUS
-0.58
POSITIVE LOGITS
ifiable
1.29
ifications
1.16
ification
0.98
ify
0.97
ified
0.96
if
0.94
ignore
0.90
ifi
0.89
plain
0.86
ifiers
0.84
Activations Density 0.072%