INDEX
Explanations
statements expressing personal opinion or ownership
expressions of personal opinions or ownership
New Auto-Interp
Negative Logits
alot
-0.71
elsewhere
-0.71
everywhere
-0.69
OTHER
-0.66
ALSO
-0.62
somew
-0.60
)))
-0.60
neighbouring
-0.59
))))
-0.58
somewhere
-0.57
POSITIVE LOGITS
asuring
0.56
ãĥīãĥ©
0.53
rone
0.51
unci
0.50
ãĥŃ
0.49
ãĤ©
0.48
unciation
0.48
uj
0.48
houn
0.47
xtap
0.47
Activations Density 0.950%