INDEX
Explanations
words related to filtering or selective processes
references to filtering mechanisms and related terms
New Auto-Interp
Negative Logits
ciating
-0.78
erald
-0.71
arrass
-0.66
ington
-0.64
olars
-0.63
ocamp
-0.62
Legends
-0.62
eanor
-0.62
lished
-0.61
aving
-0.61
POSITIVE LOGITS
filter
0.99
filters
0.95
filter
0.94
filtering
0.87
Filter
0.81
cutoff
0.80
operator
0.79
Filter
0.78
ters
0.78
filtered
0.76
Activations Density 0.044%