INDEX
Explanations
words containing the substring "irm"
variations of the word "affirm" and its derivatives
New Auto-Interp
Negative Logits
REDACTED
-0.75
ç«
-0.67
å¹
-0.66
Cong
-0.63
Rus
-0.62
å¥
-0.62
largeDownload
-0.62
Ree
-0.62
lihood
-0.62
HY
-0.61
POSITIVE LOGITS
irm
1.75
irms
1.10
irmed
1.00
ament
0.99
atively
0.97
etric
0.95
irmation
0.91
aton
0.91
ative
0.89
ities
0.86
Activations Density 0.007%