INDEX
    Explanations

    mentions of social media platforms and their associated activities

    New Auto-Interp
    Negative Logits
    AddTagHelper
    -0.51
    AISSEE
    -0.48
    gever
    -0.45
    basicConfig
    -0.44
    mantec
    -0.43
    RegressionTest
    -0.43
    Sail
    -0.42
     siff
    -0.42
    机制
    -0.42
    charest
    -0.41
    POSITIVE LOGITS
     ligiloj
    0.45
    featureID
    0.44
     dAtA
    0.42
    ArgsConstructor
    0.39
    RegistryLite
    0.38
    itschrift
    0.37
    הערות
    0.37
    脚注の使い方
    0.36
     يتيمه
    0.36
     koordin
    0.35
    Act Density 0.006%

    No Known Activations