INDEX
    Explanations

    phrases expressing superlatives and comparisons

    New Auto-Interp
    Negative Logits
    824
    -0.17
     Tul
    -0.17
    æĽ²
    -0.16
    ML
    -0.15
    863
    -0.15
    kara
    -0.15
    ç©¶
    -0.15
     no
    -0.15
     Klo
    -0.14
    ingers
    -0.14
    POSITIVE LOGITS
    hev
    0.16
    رÙĪØ¹
    0.15
     ever
    0.15
     nunca
    0.15
    never
    0.14
    ever
    0.14
     ilg
    0.14
     reife
    0.14
     unprecedented
    0.14
    -ever
    0.14
    Act Density 0.083%

    No Known Activations