INDEX
    Explanations

    dialogue between two people

    New Auto-Interp
    Negative Logits
    tituzione
    -0.49
    redhat
    -0.48
    uff
    -0.44
    histor
    -0.43
     affari
    -0.42
    am
    -0.42
     hybrid
    -0.42
    HEET
    -0.42
     delito
    -0.40
     podamos
    -0.40
    POSITIVE LOGITS
    ArrowToggle
    0.94
     חיצוניים
    0.87
    aarrggbb
    0.79
    scrapy
    0.78
    AxisAlignment
    0.75
    featureID
    0.74
     ProtoMessage
    0.73
     <=",
    0.73
    InjectAttribute
    0.72
    Geplaatst
    0.71
    Act Density 0.495%

    No Known Activations