INDEX
Explanations
entities related to statements or quotations
variations of the word "is"
New Auto-Interp
Negative Logits
Gmail
-0.70
notebooks
-0.68
decap
-0.68
illiter
-0.67
decomp
-0.67
pyramid
-0.64
mainland
-0.64
parap
-0.64
misdem
-0.63
liter
-0.63
POSITIVE LOGITS
s
1.16
İ
0.89
ski
0.88
ship
0.87
ï¸ı
0.86
else
0.85
IJ
0.84
bly
0.84
save
0.82
d
0.79
Activations Density 0.207%