INDEX
    Explanations

    phrases related to accountability and social justice issues

    Comes before comparisons ("like" or similar)

    just like and comparisons

    New Auto-Interp
    Negative Logits
     <<<<<<<<<<<<<<
    -0.53
     defaultstate
    -0.53
    Бахар
    -0.52
    ViewImports
    -0.52
    ApplicationTests
    -0.51
    diali
    -0.51
    脚注の使い方
    -0.49
    ]');
    -0.49
    )');
    -0.48
    HtmlAttribute
    -0.48
    POSITIVE LOGITS
     like
    3.17
     Like
    2.48
    Like
    2.46
    like
    2.27
     LIKE
    2.21
     unlike
    2.00
     seperti
    1.99
    LIKE
    1.92
     zoals
    1.85
    unlike
    1.83
    Act Density 0.945%

    No Known Activations