INDEX
    Explanations

    phrases indicating quantity or comparison

    New Auto-Interp
    Negative Logits
     very
    -0.30
     Very
    -0.25
    very
    -0.22
    Very
    -0.22
    å¾Ī
    -0.21
     muito
    -0.21
     VERY
    -0.19
     quite
    -0.19
     molto
    -0.18
     more
    -0.18
    POSITIVE LOGITS
     tarde
    0.23
     importante
    0.18
    preci
    0.17
     vast
    0.16
     grande
    0.16
     κον
    0.16
     antig
    0.15
     alta
    0.15
    alto
    0.15
    _advanced
    0.15
    Act Density 0.014%

    No Known Activations