INDEX
    Explanations
    New Auto-Interp
    Negative Logits
     péri
    -0.64
     jadx
    -0.60
     assolu
    -0.54
     tiểu
    -0.54
     shooters
    -0.49
    jface
    -0.48
     unhas
    -0.47
     Reporters
    -0.47
    Filmographie
    -0.47
    enterOuterAlt
    -0.46
    POSITIVE LOGITS
    room
    0.91
    rooms
    0.77
     room
    0.73
     Roskov
    0.68
    ROOM
    0.67
     rooms
    0.66
     kasarigan
    0.65
    Искәрмәләр
    0.65
    Room
    0.64
    Rooms
    0.59
    Act Density 0.004%

    No Known Activations