INDEX
    Explanations
    New Auto-Interp
    Negative Logits
     mannit
    0.37
    ктором
    0.35
    aną
    0.34
    ばかり
    0.34
     fift
    0.34
     misfort
    0.34
    情報を
    0.34
     importanti
    0.34
     gacchati
    0.34
     afflicted
    0.34
    POSITIVE LOGITS
     bathrooms
    0.44
     levels
    0.43
     tiers
    0.41
     antennae
    0.41
    Levels
    0.41
     bedrooms
    0.40
    levels
    0.40
     cameras
    0.39
     cups
    0.39
     jets
    0.38
    Act Density 0.046%

    No Known Activations