INDEX
Explanations
instances where things are divided into distinct parts or groups
occurrences of the word "split" and related terms that denote division or separation
New Auto-Interp
Negative Logits
agara
-0.75
trak
-0.68
za
-0.68
ties
-0.65
cycles
-0.65
ECH
-0.64
jamin
-0.62
oken
-0.62
instein
-0.60
oha
-0.60
POSITIVE LOGITS
hairs
1.22
creen
0.99
evenly
0.93
between
0.91
opinion
0.74
Between
0.73
Personality
0.70
SPL
0.68
owship
0.68
ters
0.68
Activations Density 0.083%