INDEX
Explanations
statements or declarations related to various situations or topics
statements or official communications related to events or incidents
New Auto-Interp
Negative Logits
theorem
-0.65
omics
-0.65
hiba
-0.65
skill
-0.64
EStream
-0.63
oleon
-0.63
otrop
-0.63
atron
-0.63
opian
-0.62
çļ
-0.59
POSITIVE LOGITS
statement
1.13
Statement
1.03
clar
1.02
clarified
1.01
spokesperson
0.97
disav
0.95
clarification
0.94
refuted
0.94
apologised
0.93
emailed
0.92
Activations Density 0.500%