INDEX
Explanations
phrases indicating personal or subjective ownership or involvement
possessive pronouns and phrases indicating ownership or personal connection
New Auto-Interp
Negative Logits
é¾įåĸļ士
-0.62
Analytics
-0.62
irect
-0.62
icity
-0.62
HTTPS
-0.61
Recon
-0.60
Disclosure
-0.60
Returns
-0.58
Accessed
-0.57
endo
-0.57
POSITIVE LOGITS
fault
1.33
thing
0.97
cup
0.96
Fault
0.96
liking
0.92
problem
0.89
usual
0.87
style
0.84
intention
0.80
idea
0.80
Activations Density 0.096%