INDEX
    Explanations

    references to physical locations or positions in relation to people or objects

    New Auto-Interp
    Negative Logits
    ortex
    -0.17
    ancia
    -0.17
     éĤ
    -0.16
     GOODMAN
    -0.16
    ppo
    -0.16
    rah
    -0.15
    ancias
    -0.14
    ÑĸÑĶ
    -0.14
    ève
    -0.14
    enticate
    -0.14
    POSITIVE LOGITS
     cameras
    0.24
     camera
    0.22
    -camera
    0.22
     eyes
    0.20
     Eyes
    0.18
    eyes
    0.17
     Cameras
    0.17
    uria
    0.17
    Camera
    0.17
     Camera
    0.16
    Act Density 0.036%

    No Known Activations