INDEX
Explanations
mentions of various representatives and meetings involving them
New Auto-Interp
Negative Logits
\\\\\\\\\\\\\\\\
-0.81
osure
-0.79
[|
-0.77
spir
-0.72
seed
-0.69
strap
-0.68
\\\\\\\\
-0.68
fall
-0.66
terson
-0.65
tered
-0.65
POSITIVE LOGITS
hips
1.39
Kislyak
1.02
onse
0.85
atives
0.78
hip
0.76
holder
0.76
ials
0.73
akes
0.73
atures
0.72
representing
0.70
Activations Density 0.011%