INDEX
Explanations
proper nouns and names associated with various entities or characters
Tokens after capitalized words
Holiday Mansion, junar, KTLO, ao7b, Darroti
New Auto-Interp
Negative Logits
overall
-0.44
/*:
-0.41
დი
-0.40
actual
-0.37
principalTable
-0.37
ever
-0.36
Pain
-0.36
VARD
-0.35
anol
-0.35
Obrador
-0.35
POSITIVE LOGITS
الدراسه
0.61
Viited
0.59
帖最后由
0.56
مرئيه
0.54
ivelany
0.52
colgante
0.52
nonUne
0.52
0.51
يتيمه
0.51
ujednoznacz
0.49
Activations Density 0.577%