INDEX
Explanations
sentences that contain high activation values, indicating important or impactful statements
New Auto-Interp
Negative Logits
te
-0.45
Newswire
-0.45
坦
-0.43
лев
-0.43
óa
-0.42
wj
-0.42
vania
-0.42
GV
-0.41
PostMapping
-0.41
mav
-0.41
POSITIVE LOGITS
+#+#
1.06
tagHelperRunner
0.93
resourceCulture
0.90
مشين
0.86
writeFieldEnd
0.82
WebVitals
0.82
iconFacebook
0.81
(!__
0.79
SharedDtor
0.78
BoxFit
0.77
Activations Density 0.251%