The WHATWG Blog

The WHATWG Blog
Staged proposals at the WHATWG
April 28th, 2025
by
Domenic Denicola
The WHATWG's living standards incorporate
new features
on an ongoing basis. The default process is to propose an idea on the relevant standard's issue tracker, hash out the details and gather implementer interest, and then work together with the editors to land a pull request.
However, we've found that for larger features, or for community members less experienced with standards contributions, sometimes this lightweight process does not give enough guidance. It can be tricky to attract attention to one's proposal, or to get the clear signals from implementers that are necessary to land changes.
Inspired by other working groups, most notably the
TC39
group that defines the JavaScript language, the WHATWG community has established a new, optional
Stages process
for additions to WHATWG standards. Features can proceed from stage 0, where they just consist of a problem description, through to stage 4, when the feature has fully landed in the relevant living standard. Each stage requires increasing levels of community consensus, including from implementers and from the standard's editors. This gives the proposer, as well as the community, a better idea of a feature's progress towards being considered an accepted part of the web platform.
We've now had the Stages process for over a year. As of today, one proposal,
the
node.moveBefore()
atomic move operation
, has advanced to
stage 4
; one, for
the customizable

hidden=a%0D%0Ab%0D%0Ac%0D%0Ad
But although it might seem simple on the surface, newline normalization in form submissions is a topic that runs deeper than I thought, with bugs in the spec and differences across browsers. This post goes through what the spec used to do, what browsers (used to) implement, and how we went about fixing it.
First, some background on form submission
The data to submit from a form is modeled as an
entry list
– entries being pairs of names (strings) and values (either strings or
File
objects). This is a list rather than a map because a form can have multiple values for each name – which is is how

and

Here is the resulting
multipart/form-data
payload in Chrome and Safari (newlines are always CRLF):
------WebKitFormBoundaryjlUA0jn3NUYxIh2A
Content-Disposition: form-data; name="hidden a%0D%0Ab"

------WebKitFormBoundaryjlUA0jn3NUYxIh2A
Content-Disposition: form-data; name="file a%0D%0Ab"; filename="a%0Db"
Content-Type: application/octet-stream

------WebKitFormBoundaryjlUA0jn3NUYxIh2A--
And this is in Firefox 88 (the current stable version as of this writing):
-----------------------------26303884030012461673680556885
Content-Disposition: form-data; name="hidden a b"

-----------------------------26303884030012461673680556885
Content-Disposition: form-data; name="file a b"; filename="a b"
Content-Type: application/octet-stream

-----------------------------26303884030012461673680556885--
As you can see, Firefox substitutes a space for any newlines (CR, LF or CRLF) in the
multipart/form-data
encoding of entry names and filenames, rather than percent-encoding them as do Chrome and Safari. This behavior was made illegal in the spec in pull request
#6282
, but it couldn't be fixed in Firefox until the spec decided on a normalization behavior. In the case of values, Firefox normalizes to CRLF as the other browsers do.
As for Chrome and Safari, here we see that newlines in entry names and string values are normalized to CRLF, but filenames are not normalized. From the entry list construction algorithm as described above, this makes sense because entry values are only normalized to CRLF
when they are strings
– files are unchanged, and so are their filenames.
Except that, if you change the form's enctype in the above example to
application/x-www-form-urlencoded
, you get this in every browser:
hidden+a%0D%0Ab=a%0D%0Ab&file+a%0D%0Ab=a%0D%0Ab
Since
multipart/form-data
is the only enctype that allows file uploads, other enctypes use their filenames instead. But here it seems like every browser is CRLF-normalizing the filenames, even though in the spec that substitution happens long after constructing the entry list.
Normalizations with
FormData
and
fetch
The
FormData
class started out as a way to send
multipart/form-data
form payloads through the
XMLHttpRequest
and
fetch
APIs without having to generate that payload in JavaScript. As such,
FormData
instances are basically a JS-accessible wrapper over an entry list.
So let's try the same with
FormData
const formData = new FormData();
formData.append("hidden a\rb", "a\rb");
formData.append("file a\rb", new File([], "a\rb"));

// FormData objects in fetch will always be serialized as multipart/form-data.
await fetch("./post", { method: "POST", body: formData });
Safari sends the same form payload as above, with names and values normalized to CRLF, and so does Firefox 88 with values normalized to CRLF (and names and values having their newlines escaped as spaces). But Chrome keeps names, filenames and values unnormalized (here the
character stands for CR):
------WebKitFormBoundarySMGkMfD8mVOnmGDP
Content-Disposition: form-data; name="hidden a%0Db"

------WebKitFormBoundarySMGkMfD8mVOnmGDP
Content-Disposition: form-data; name="file a%0Db"; filename="a%0Db"
Content-Type: application/octet-stream

------WebKitFormBoundarySMGkMfD8mVOnmGDP--
Since
FormData
is just a wrapper over an entry list, and
fetch
simply calls the
multipart/form-data
encoding algorithm, no normalizations should take place. So it looks like Chrome was following the spec here, while Firefox and Safari were apparently doing some newline normalization (for Firefox, on string values only) at the time of serializing as
multipart/form-data
With
FormData
you can also investigate what the "construct the entry list" algorithm does, since if you pass a

So it seems like Firefox and Safari are not normalizing as they construct the entry list, and instead normalize names and values at the time that they encode the form into an enctype. In particular, since the
application/x-www-form-urlencoded
and
text/plain
enctypes don't allow file uploads, file entry values are substituted with their filenames
before
the normalization. Entry lists that aren't created from the "construct an entry list" algorithm get normalized all the same.
Chrome instead follows the specification (as it used to be) in normalizing in "construct an entry list" and not normalizing later, even for entry lists created through other means. But that doesn't explain why filenames in the
application/x-www-form-urlencoded
and
text/plain
enctypes are normalized. Does Chrome also have an additional normalization layer?
Investigating late normalization with the
formdata
event
It would be great to investigate in more detail what Chrome and other browsers do
after
constructing the entry list. Since the entry list construction already normalizes entries, any further normalizations that might happen further down the line are obscured in the common case.
In the case of
multipart/form-data
, we can test this because using a
FormData
object with
fetch
doesn't invoke "construct an entry list", and so can see what happens to unnormalized entries. For other enctypes there is no way to create an entry list that doesn't go through "construct an entry list", but as it turns out, the "construct an entry list" algorithm itself offers two ways to end up with unnormalized entries:
form-associated custom elements
(only implemented in Chrome so far) and the
formdata
event
(implemented in Chrome and Firefox). Here we'll only be covering the latter, since their results are equivalent.
One thing I skipped when I covered the "construct an entry list" algorithm above is that, at the end of the algorithm, after all entries corresponding to controls have been added to the entry list, a
formdata
event is fired on the relevant

element. This event has a
formData
attribute which allows you not only to inspect the entry list at that point, but to modify it.
id="form"
action="./post"
method="post"
enctype="application/x-www-form-urlencoded"

For both Chrome and Firefox (not Safari because it doesn't support the
formdata
event), trying this with the
application/x-www-form-urlencoded
enctype gets you a normalized result:
string+a%0D%0Ab=a%0D%0Ab&file+a%0D%0Ab=a%0D%0Ab
Firefox shows the same normalizations for the
text/plain
enctype; Chrome instead normalizes only filenames, not names and values. And with
multipart/form-data
we get the same result as with
fetch
and
FormData
above: Chrome doesn't normalize anything, Firefox normalizes string values (with names and filenames being replaced with spaces).
So in short:
For
application/x-www-form-urlencoded
, all browsers perform an additional newline normalization at the moment of serializing the form payload, whether or not they normalize when constructing the entry list. Note that newline normalizations are idempotent, so normalizing an already normalized string doesn't change it.
For
text/plain
, Firefox and Safari seem to act just like for
application/x-www-form-urlencoded
. Chrome instead only normalizes filenames, probably at the same time as files are being substituted with their filenames.
For
multipart/form-data
, Chrome doesn't normalize anything. Safari instead normalizes names and string values, but not filenames. Firefox does the same as Safari for values, but replaces any newlines in names and filenames with a space.
Remember that these differences across browsers don't really affect the encoding of normal forms, they only matter if you're using
FormData
with
fetch
, the
formdata
event, or form-associated custom elements.
Fixing the spec
So which behavior do we choose? Firefox replacing newlines in
multipart/form-data
names and values with a space is illegal as per
PR #6282
, but anything else is fair game.
For
text/plain
, we have Firefox and Safari behaving in the same way, and Chrome disagreeing. Since
text/plain
cannot represent inputs unambiguously, is little used in any case, and you would need either form-associated custom elements or to use the
formdata
event to see a difference, it seems extremely unlikely that there is web content that depends on either case. So it makes more sense to treat
text/plain
just like
application/x-www-form-urlencoded
and normalize names, filenames and values.
For
multipart/form-data
, there is the added compatibility risk that you can observe this case by using
FormData
and
fetch
, so it's more likely to cause webcompat issues, no matter whether we went with Safari's or Chrome's behavior. In the end we choose to go with Safari's, in order to be consistent at normalizing all enctypes – although the
multipart/form-data
has to be different in not normalizing filenames, of course.
So in pull request
#6287
we fixed this by:
Adding
a new algorithm
that runs before the
application/x-www-form-urlencoded
and
text/plain
serializers. This algorithm first extracts a filename from the entry value, if it's a file, and then CRLF-normalizes both the name and value.
We changed the
multipart/form-data
encoding algorithm
to have a first step that CRLF-normalizes names and string values, leaving file values intact.
Do we need the early normalization though?
At the time that we decided the above changes to the spec, I still thought that the spec's (and Chrome's) behavior of normalizing in "construct the entry list" was correct. But later on I realized that, once we have the late normalizations mentioned above, the early normalization in "construct the entry list" doesn't matter for form submission, since the late normalizations do everything the early one can do and more. The only way you could observe whether that early normalization is present or not is through the
FormData
constructor. So it would make sense to remove that early normalization and standardize on Firefox and Safari's behavior here, as we did in pull request
#6624
One kink remained, though:
 elements support the wrap="hard" attribute, which adds linebreaks into the submitted value corresponding to how the text is linewrapped in the UI. In the spec, this is done through the "textarea wrapping transformation" , which takes the textarea's "raw value" , normalizes it to CRLF, and when wrap="hard" , adds CRLF newlines to wrap the contents. But if you test this on Safari, all newlines (both normalized and added) are LF – and Firefox currently doesn't implement wrap="hard" , but it does normalize newlines to LF. So should this be changed? I thought it was better to align on Firefox's and Safari's behavior, especially since this could simplify the mess that is the difference between "raw value", "API value" and "value" for <textarea> in the spec. Chrome disagreed at first – but as it turns out , Chrome's implementation of the textarea wrapping transformation normalizes to LF, and it's only the normalization in "construct an entry list" that normalizes those newlines to CRLF. So Chrome would align with Firefox and Safari in that area by just removing the early normalization. Pull request #6697 fixes this issue with <textarea> in the spec, and the follow-up issue #6662 will take care of simplifying the textarea wrapping transformation in the spec, as well as ensure that it matches implementations. Implementation fixes After those three pull requests, the current spec mandates that no normalization happen in "construct the entry list", and that the entry lists to be encoded with some enctype go through a transformation that depends on the enctype: For application/x-www-form-urlencoded and text/plain , this transformation first replaces file values with their filenames, and then CRLF-normalizes names and values. For multipart/form-data , this transformation CRLF-normalizes names and string values, but not filenames. Safari implements the current spec behavior, and so doesn't need any fixes as a result of these spec changes. However, when working on them I noticed that Safari had a preexisting bug in that it wasn't converting entry names and string values into scalar value strings in the "construct an entry list" algorithm, which could lead to FormData objects containing DOMString values despite the WebIDL declaration. What's more, those lone surrogates would remain there until the time of serializing the form, and might show up in the form payload as WTF-8 surrogate byte sequences. This is bug 225299 Firefox acts much like Safari, except in that it escapes newlines in multipart/form-data names and filenames as spaces rather than CRLF-normalizing in the case of names and then percent-encoding. With the normalization now defined in the spec, this could now be changed. This is bug 1686765 . I additionally also found the same bug as Safari with not converting strings into scalar value strings, except in this case the normalization did take place when encoding ( bug 1709066 ). Both issues are now fixed in Firefox Nightly and will ship on Firefox 90. Finally, Chrome would need to remove the normalization it performs in "construct an entry list", update the normalization it performs on the text/plain enctype to cover not only filenames but also names and values, and add an additional normalization for multipart/form-data names and values. This is covered in issue 1167095 . Since Chrome is the browser that needs the most changes as a result of these spec changes, and they are understandably concerned with backwards compatibility, these fixes are expected to ship behind a flag so they can be quickly rolled back in case of trouble. Conclusion Although form submission is not necessarily seen as an easy topic, newline normalization in that context might not seem like such a big deal from the outside. But in a platform like the web, where spec and implementation decisions can easily snowball into compatibility concerns, even things that might look simple might have a history that makes standardizing on one behavior hard. Posted in Browsers Forms WHATWG Comments Off on Newline normalizations in form submission « Older Entries Pages FAQ Chat Code of Conduct Submit an Article Archives April 2025 October 2023 March 2022 December 2021 May 2021 April 2021 January 2020 October 2019 September 2018 June 2018 December 2017 July 2017 June 2017 April 2017 January 2017 November 2016 September 2016 August 2016 May 2016 April 2016 February 2016 January 2016 October 2014 September 2014 October 2013 July 2012 June 2012 May 2012 April 2012 March 2012 February 2012 January 2012 December 2011 November 2011 October 2011 September 2011 August 2011 July 2011 June 2011 May 2011 April 2011 March 2011 February 2011 January 2011 July 2010 May 2010 January 2010 November 2009 October 2009 September 2009 August 2009 July 2009 May 2009 April 2009 March 2009 February 2009 January 2009 December 2008 November 2008 October 2008 September 2008 August 2008 July 2008 June 2008 May 2008 April 2008 March 2008 February 2008 January 2008 November 2007 October 2007 September 2007 August 2007 June 2007 May 2007 April 2007 March 2007 February 2007 January 2007 December 2006 November 2006 Categories Browser API (3) Browsers (15) Conformance Checking (22) DOM (7) Elements (20) Events (4) Forms (7) Multimedia (2) Processing Model (15) Syntax (29) Tutorials (11) W3C (6) Weekly Review (89) What's Next (8) WHATWG (71)</div> </div> <div class="detail-actions"> <a href="/search?q=blog.whatwg.org" class="btn">Same domain →</a> <a href="/search?q=The%20WHATWG%20Blog" class="btn btn-secondary">Similar titles →</a> </div> </article> </main> <footer class="site-footer"> <div class="container"> C U Cyber History — Public Interest Web Archive Preserving fading web memories. Discover history that once existed. </div> </footer> <script id="chat-i18n-en" type="application/json">{"button_label":"Need Help?","placeholder":"Ask us anything...","title":"CUCH Assistant","subtitle":"How can we help you?","send":"Send","close":"Close","folder":"/var/www/cu","greeting":"Hi! Welcome to CUCH.org. How can I help you today? Feel free to ask about our archive, search, or anything else!","error":"Sorry, our service is temporarily unavailable. Please try again later.","banner_text":"Need help? Ask our AI assistant!"}</script> <script src="/static/js/chat-widget.js"></script> </body> </html>