INDEX
    Explanations

    The neuron activates on occurrences of “dinosaur” or specific dinosaur‐related terms (e.g., dinosaur, dinosaurs, dinosauria, Triceratops, Stegosaurus, T-Rex).

    New Auto-Interp
    Negative Logits
     streaming
    -0.07
     bubble
    -0.06
     tuples
    -0.06
     placebo
    -0.06
     proces
    -0.06
    plets
    -0.06
     καθώς
    -0.06
     ProgressBar
    -0.06
     coolant
    -0.06
     repaint
    -0.06
    POSITIVE LOGITS
     dinosaur
    0.14
     dinosaurs
    0.13
    osaurs
    0.10
    osaur
    0.08
    inosaur
    0.08
    SF
    0.08
     Jurassic
    0.07
    (?
    0.07
     đổ
    0.07
    .AlertDialog
    0.07
    Act Density 0.005%

    No Known Activations