INDEX

Explanations

Let's

New Auto-Interp

Configuration

Dataset (Dashboard)

Various

Embeds

IFrame

Link

Not in Any Lists

No Comments

Negative Logits

 harmful

-0.09

түр

-0.09

 fined

-0.09

 સ્ત

-0.08

 fréquent

-0.08

 магаз

-0.08

Costo

-0.08

 voorkomen

-0.08

 Sympathy

-0.08

 штраф

-0.08

POSITIVE LOGITS

 gemeinsam

0.13

 enthusiasm

0.12

 teamwork

0.12

 enthousiasme

0.12

 embarking

0.11

 collaboratively

0.11

 kickoff

0.11

 collaborating

0.11

 enthusiast

0.11

 కలిసి

0.11

Activations Density 0.018%