INDEX
Explanations
phrases with emphasis on the quantity or exclusivity of certain objects or concepts
phrases that indicate possession or existence
New Auto-Interp
Negative Logits
TG
-0.74
ensing
-0.72
SHIP
-0.68
territ
-0.66
Interested
-0.63
Tweet
-0.62
PHOTOS
-0.61
bart
-0.59
eem
-0.58
responsible
-0.58
POSITIVE LOGITS
been
1.35
undergone
1.19
been
1.17
Been
1.07
kell
1.02
existed
0.98
similarities
0.98
become
0.96
implications
0.95
arisen
0.95
Activations Density 0.303%