INDEX
Explanations
mentions of rags or objects made of rags
variations of the word "rag."
New Auto-Interp
Negative Logits
ĨĴ
-0.83
erves
-0.68
ĺ
-0.65
iership
-0.65
ight
-0.64
chart
-0.63
idential
-0.62
ĺħ
-0.61
KER
-0.60
renovations
-0.60
POSITIVE LOGITS
shaw
0.95
azine
0.92
eworks
0.91
MENTS
0.90
nir
0.87
lan
0.86
asso
0.82
netic
0.82
da
0.81
rag
0.81
Activations Density 0.038%