INDEX
Explanations
narrow or specific terms
instances of the word "narrow" or related terms that convey narrowness
New Auto-Interp
Negative Logits
ICAN
-0.78
CF
-0.75
YL
-0.74
FORE
-0.73
IRED
-0.73
natureconservancy
-0.71
Destruction
-0.70
Mobil
-0.69
è¦ļéĨĴ
-0.68
NSA
-0.68
POSITIVE LOGITS
minded
0.98
narrow
0.94
sided
0.93
creen
0.93
sighted
0.85
confines
0.84
minded
0.83
cut
0.82
band
0.82
strokes
0.81
Activations Density 0.019%