INDEX
    Explanations
    New Auto-Interp
    Negative Logits
     mijne
    -0.49
     zelve
    -0.49
     çat
    -0.48
     verster
    -0.47
     trouw
    -0.47
     ouder
    -0.45
     trưng
    -0.44
    มาะ
    -0.44
     lijn
    -0.44
     geluk
    -0.43
    POSITIVE LOGITS
     video
    1.18
     Video
    1.16
    Video
    1.16
    video
    1.11
     videos
    1.06
     Videos
    1.05
     VIDEO
    1.03
    Videos
    0.93
    videos
    0.93
    VIDEO
    0.91
    Act Density 0.055%

    No Known Activations