INDEX
Explanations
specific technical features or attributes indicated by numerical values
possessive verbs indicating ownership or attributes
New Auto-Interp
Negative Logits
TG
-0.66
Interested
-0.65
endeavour
-0.64
FW
-0.63
iss
-0.62
iling
-0.60
ensing
-0.59
YING
-0.59
MO
-0.58
fty
-0.58
POSITIVE LOGITS
been
1.57
undergone
1.46
been
1.32
Been
1.17
become
1.14
survived
1.12
existed
1.12
gotten
1.06
stood
1.06
arisen
1.04
Activations Density 0.187%