INDEX
Explanations
comments or statements made by individuals
instances of the word "comment" or its variations indicating opinions or statements made by individuals
New Auto-Interp
Negative Logits
nown
-0.93
Sinai
-0.85
turf
-0.74
ãĥ¼ãĥ«
-0.70
married
-0.68
lay
-0.68
adena
-0.68
riz
-0.67
Lansing
-0.67
fold
-0.65
POSITIVE LOGITS
ariat
0.96
ature
0.86
ively
0.84
favorably
0.82
ault
0.76
ivity
0.76
atures
0.75
atively
0.74
ivating
0.74
ivated
0.73
Activations Density 0.025%