INDEX
Explanations
references to a 'band' or similar grouping in various contexts
New Auto-Interp
Negative Logits
̷
-0.55
Fea
-0.51
জি
-0.50
suku
-0.48
Interviewer
-0.48
McGuire
-0.47
Subject
-0.47
Answered
-0.46
sog
-0.46
tasca
-0.45
POSITIVE LOGITS
bands
1.51
BAND
1.48
band
1.45
Band
1.45
Band
1.42
Bands
1.38
band
1.31
Bands
1.31
BAND
1.28
bands
1.25
Activations Density 0.181%