INDEX
    Explanations

    instances of the word "this" and its variations

    this followed by explanation

    New Auto-Interp
    Negative Logits
     IBOutlet
    -0.48
     soap
    -0.47
    hyrchwyd
    -0.47
    LookAnd
    -0.46
     parlour
    -0.45
     Rotating
    -0.45
     houſe
    -0.45
     freezer
    -0.45
    lrrrr
    -0.45
     haer
    -0.43
    POSITIVE LOGITS
    AccessorTable
    0.61
     spowod
    0.52
     resulting
    0.49
     means
    0.47
     caused
    0.45
     powod
    0.45
     enables
    0.44
     allows
    0.44
     continúas
    0.44
     wodurch
    0.43
    Act Density 0.130%

    No Known Activations