INDEX

Explanations

expressions involving variables

This neuron responds to cue words and phrases that structure math problem statements—e.g. “Let,” “where,” “assuming,” “What is,” “Express,” “Determine,” etc.—marking definitions, conditions, and question prompts.

New Auto-Interp

Configuration

Prompts (Dashboard)

24,576 prompts, 128 tokens each

Dataset (Dashboard)

monology/pile-uncopyrighted

Embeds

IFrame

Link

Not in Any Lists

Negative Logits

amd

-1.80

 harika

-1.77

 ücretsiz

-1.63

܊

-1.63



-1.55

cea

-1.54

hoge

-1.53

 astonishing

-1.53

 basit

-1.52

những

-1.51

POSITIVE LOGITS

 because

2.14

 when

1.80

 Both

1.73

 both

1.71

 Generally

1.69

 Normally

1.69

 After

1.65

 Before

1.63

 Initially

1.62

 When

1.60

Activations Density 0.316%