INDEX
Explanations
phrases related to uncertainty or questioning
statements of fact or assertions about various subjects
New Auto-Interp
Negative Logits
—-
-0.70
—"
-0.66
Chero
-0.63
Seym
-0.61
beforehand
-0.56
Afterwards
-0.56
-"
-0.55
traged
-0.55
summers
-0.54
||||
-0.53
POSITIVE LOGITS
rael
1.17
hereby
0.93
ometric
0.87
olation
0.85
Ĥİ
0.82
Ĥª
0.80
currently
0.78
rated
0.78
olate
0.78
othermal
0.75
Activations Density 0.283%