@Behnam provided the necessary info in https://github.com/w3c/csswg-drafts/issues/698#issuecomment-297472304 to define shaping behavior, so we should spec it.
In https://github.com/w3c/csswg-drafts/issues/698#issuecomment-444470317 @jfkthame wrote:
Sure, i understand that, but the example you give is not a particularly useful thing to do. I suspect that much of the time one will apply first-letter in order to change the formatting (see the image at #698 (comment)).
We have https://drafts.csswg.org/css-inline-3/#initial-letter-shaping saying that
When initial-letters is not normal, shaping should still occur across an inline initial letter boxās boundaries. ... For example, if the first letter of the Farsi word āŁ¾Ų³ā were styled with initial-letters: 2 1, both letters would be styled in their joined forms, with initial-form āļā as the initial letter, followed by the normally-styled final-form āļŗ²ā. Note that the two letters might not always graphically connect, even when shaped in their joining forms. (my emphasis)
But we have https://drafts.csswg.org/css-text/#boundary-shaping saying that
Text shaping must be broken at inline box boundaries when any of the following are true for any box whose boundary separates the two typographic character units: Any of margin/border/padding separating the two typographic character units in the inline axis is non-zero. ...
The two things seem to be contradictory, or at least to warrant some additional explanation.
If the first letter is styled with a different font-size, font-family, etc it's entirely possible it won't actually connect to the next letter even when the appropriate contextual forms are used; I think that's what the emphasized sentence above is pointing out.
Adding margin or border around the letter, OTOH, would be a cause for not shaping between the initial letter and following text.
Raised caps are an interesting edge case, where I guess it's reasonable to maintain joining as the baseline isn't changed and no horizontal separation is being introduced (though it seems an unlikely thing for someone to really want to do in such a script).
The few examples I've seen of traditional practice (see https://github.com/w3c/csswg-drafts/issues/698#issuecomment-297472304 and the linked discussion) don't appear to support shaping across more general drop-cap-like formatting: the large dropped, boxed initial in https://upload.wikimedia.org/wikipedia/commons/6/6a/Hafezeshamlu02.jpg, for example, is not shaped in an initial form.
The few examples I've seen of traditional practice (see #698 (comment) and the linked discussion) don't appear to support shaping across more general drop-cap-like formatting: the large dropped, boxed initial in https://upload.wikimedia.org/wikipedia/commons/6/6a/Hafezeshamlu02.jpg, for example, is not shaped in an initial form.
Do we have a source which we believe is reliable that _does_ show enlarged initial letters using joining forms? @behnam, @shervinafshar, @khaledhosny, @sahafshar, @ntounsi, @mostafa, any ideas?
@r12a, this topic has been bouncing around for a bit now. My personal take is that since this is a novelty of a typographical practice in Arabic script text, there is no reliable source. However, the research shows that (a) most cases of drop-caps in Arabic script texts are not using joining forms; (b) through my research on the matter, I believe that there is also some precedence for the drop-caps in Arabic script text to _use of joining forms_. This requires further investigation, but I reproduce the case that I'm referring to as a rare precedence.
What follows is from the volume _Compendium of Latin Translations of Persian Astronomical Tables_, selected portions of _ZÄ«j-i SultÄnÄ«_ translated into Latin and published by Oxford University Press in 1655.


Further pointers:
ā my thread on Persian Computing
ā Liam Quin's blogpost has a section with his findings, points to the thread above
Iāve only ever seen examples of this form as non-connected (from what I recall itās a very rare practice in general, and I canāt think of many modern day uses of it. Attached example is from Ketabāe Hafte (Book of the Week) published in Tehran, 1962.
I agree with Shervin that itās unlikely any reliable resource can be found on this, as itās a purely typographic convention and not somethingāI reckonā you would ever see in calligraphy or manuscripts.
--
Sahar Afshar
PhD Candidate | Research Assistant
Centre for Printing History & Culture
Faculty of Art, Design and Media, Birmingham City University
Parkside Building, 5 Cardigan Street, Birmingham B4 7BD
www.cphc.org.uk @sahafshar
On 12 Dec 2018, at 16:23, Shervin Afshar notifications@github.com wrote:
@r12a https://github.com/r12a, this topic has been bouncing around for a bit now. My personal take is that since this is a novelty of a typographical practice in Arabic script text, there is no reliable source. However, the research shows that (a) most cases of drop-caps in Arabic script texts are not using joining forms; (b) through my research on the matter, I believe that there is also some precedence for the drop-caps in Arabic script text to use of joining forms. This requires further investigation, but I reproduce the case that I'm referring to as a rare precedence.
What follows is from the volume Compendium of Latin Translations of Persian Astronomical Tables https://www.wdl.org/en/item/10676/#institution=qatar-national-library, selected portions of ZÄ«j-i SultÄnÄ« https://en.wikipedia.org/wiki/Zij-i_Sultani translated into Latin and published by Oxford University Press in 1655.
https://user-images.githubusercontent.com/875962/49882078-ad7fba00-fde4-11e8-8487-b82f50932f65.png
https://user-images.githubusercontent.com/875962/49881977-6d203c00-fde4-11e8-88d2-796168ac9007.png
Further pointers:
ā my thread on Persian Computing https://groups.google.com/forum/#!topic/persian-computing/W-Upl6DAEcg
ā Liam Quin's blogpost https://barefootliam.blogspot.com/2014/05/drop-caps-other-writing-systems-other.html has a section with his findings, points to the thread aboveā
You are receiving this because you were mentioned.
Reply to this email directly, view it on GitHub https://github.com/w3c/csswg-drafts/issues/2399#issuecomment-446648774, or mute the thread https://github.com/notifications/unsubscribe-auth/AQr44x9BXmzppcREbxz11eYmqiN4iYgVks5u4S11gaJpZM4Sb7jx.
I wonder whether the answer here is to assume that by default arabic versals are _not_ joining forms, but allow authors to turn them into joining forms using ZWJ if desired??
In that case, would we need a special rule to say that ZWJ is kept with the preceding letter when using ::first-letter selection?
(Btw, @sahafshar your image didn't seem to make it into the thread.)
Fwiw (perhaps not much) if you open this test in Safari (which is the only major browser i know of that supports initial-letter behaviour) you'll see that when selecting the first letter of a word where i had inserted ZWJ after the initial letter, ::first-letter automatically picked up the ZWJ and rendered the enlarged initial as a joining form.
If there is no ZWJ (see this test) then the initial letter is unjoined, but the 2nd letter in the word is joined.
@faceless2 a nice drop cap testcase for you
I'm inclined to spec the behavior @r12a describes in https://github.com/w3c/csswg-drafts/issues/2399#issuecomment-577520633 Happy to do something different if there's some clear consensus as to what's correct, though. :)
If there is no ZWJ (see this test) then the initial letter is unjoined, but the 2nd letter in the word is joined.
@r12a When you say "but the 2nd letter in the word is joined", do you mean it takes a right-joining (medial or final) form, as if it were joined to the initial letter? Or just that it is joined to what follows it?
(As far as I can see when testing here, it takes an initial form; i.e. it does not shape as though joined to the initial letter.)
Got an iOS screenshot I got from my friend... so it does look like Safari is breaking the connection on both sides.
That said, using the correct connecting form between the second part of the word and the initial letter seems to me would be more readable than actually breaking this connection. Similar concept to how Latin drop-caps kern in the rest of the first word into the drop-cap, to help maintain a clearer connection between the first letter and the rest of its word.
I also understand that using isolated form for a drop-cap just looks a lot better; that's probably why it's more common per @shervinafshar and @sahafsharās comments.
So basically we have three options here:
I think the third option is actually the best one. Curious to hear what native users of the writing system think.
Here's a mockup of all three options, for reference:

I agree @fantasai, the third option makes most sense to me. Sorry @r12a you're right, I forgot the reference picture in my original response on the thread. I've attached it here. As you can see, it also follows the same logic as the split model.

Tentatively updated the editor's draft with text for this behavior given @sahafsharās confirmation. I'll wait on more feedback before closing the issue, though.
This change isn't script-specific? Seems reasonable. Given how hard it was finding examples for arabic etc. I'm pretty sure we'd be basing a decision on a sample size of zero for N'Ko.
@faceless2 I think in the absence of conclusive information to the contrary, it should apply to all scripts. :) The principles that make it the most sensible option (see https://github.com/w3c/csswg-drafts/issues/2399#issuecomment-635630662) are generally-applicable.
Most helpful comment
I agree @fantasai, the third option makes most sense to me. Sorry @r12a you're right, I forgot the reference picture in my original response on the thread. I've attached it here. As you can see, it also follows the same logic as the split model.
