INDEX
    Explanations

    instances of reported speech or references to statements made by individuals or entities

    New Auto-Interp
    Negative Logits
     OnInit
    -0.73
     genie
    -0.60
    HtmlAttribute
    -0.58
     otomatig
    -0.57
     CreateTagHelper
    -0.56
     sensei
    -0.54
     ostrich
    -0.54
     reafon
    -0.54
    Personendaten
    -0.54
    ArrowToggle
    -0.54
    POSITIVE LOGITS
    новништво
    0.69
    __':
    0.59
    Referanser
    0.57
    heça
    0.55
    SpringBootTest
    0.55
     journalistes
    0.55
     ErrIntOverflow
    0.55
    grape
    0.54
    بوابة
    0.53
     ब्रेकडाउन
    0.53
    Act Density 0.578%

    No Known Activations