INDEX
Explanations
the word "is" at a high activation level
instances of the verb "is."
New Auto-Interp
Negative Logits
*/
-0.63
entails
-0.63
refrain
-0.61
ependence
-0.58
know
-0.58
encounters
-0.57
urry
-0.57
wills
-0.57
itiveness
-0.56
IMAGES
-0.56
POSITIVE LOGITS
senal
1.05
rael
1.00
cussion
0.90
definitely
0.88
rumored
0.85
currently
0.83
nt
0.82
hereby
0.82
indeed
0.81
supposed
0.81
Activations Density 0.408%