INDEX
    Explanations

    words indicating social interactions or communication

    New Auto-Interp
    Negative Logits
    MigrationBuilder
    -1.35
    :✨
    -1.13
     OFDb
    -1.11
    Personensuche
    -1.04
    GEBURTSDATUM
    -1.04
    expandindo
    -1.03
    ########.
    -1.02
    providedIn
    -1.02
     ویکی‌پدی
    -0.98
     utafitiHapana
    -0.96
    POSITIVE LOGITS
    0.66
    ↵↵
    0.53
     (
    0.53
    .
    0.52
    ↵↵↵
    0.47
    0.47
        
    0.46
       
    0.45
    	
    0.44
     I
    0.44
    Act Density 0.171%

    No Known Activations