INDEX

Explanations

Extremism/Nationalism

New Auto-Interp

Configuration

Dataset (Dashboard)

Various

Embeds

IFrame

Link

Not in Any Lists

No Comments

Negative Logits

-await

-0.08

Chance

-0.08

 Gust

-0.08

 истор

-0.08

 refreshments

-0.08

 verfü

-0.07

 izango

-0.07

 कह

-0.07

 finer

-0.07

 제공

-0.07

POSITIVE LOGITS

 excessive

0.16

 exces

0.14

 excessively

0.14

 obsessive

0.13

 obses

0.13

 overly

0.13

 чрез

0.13

 fanatic

0.12

 obsession

0.12

 excesso

0.11

Activations Density 0.048%