INDEX
Explanations
phrases containing the word "only"
the phrase "not only," which suggests the inclusion of additional information or ideas
New Auto-Interp
Negative Logits
»Ĵ
-0.79
Hanson
-0.70
Nanto
-0.70
Fallen
-0.67
ij士
-0.65
lain
-0.65
largeDownload
-0.63
Bers
-0.62
glomer
-0.61
insert
-0.61
POSITIVE LOGITS
oped
0.67
ifiable
0.64
obe
0.64
othing
0.62
onew
0.61
cki
0.61
tem
0.60
verbally
0.60
interested
0.60
physically
0.59
Activations Density 0.027%