INDEX
Explanations
words related to providing evidence or confirmation
terms related to legal or substantive discussions
New Auto-Interp
Negative Logits
ged
-0.83
nesday
-0.82
bors
-0.80
enhagen
-0.78
sonian
-0.77
gerald
-0.76
fle
-0.74
keley
-0.73
lem
-0.72
gets
-0.72
POSITIVE LOGITS
substant
0.85
ively
0.77
igr
0.75
ŃĶ
0.75
ially
0.71
iations
0.71
iate
0.70
ĸļ
0.69
afort
0.68
ËĪ
0.66
Activations Density 0.023%