Uzbl

Tasklist

FS#49 - weird symbols on some pages / no support for (gzip?) compressed pages

Attached to Project: Uzbl
Opened by Anonymous Submitter - 2009-06-25 04:35:26 PM
Last edited by Anonymous Submitter - 2010-04-22 08:37:13 PM
Task Type Bug Report
Category uzbl-core
Status Closed
Assigned To No-one
Operating System All
Severity High
Priority Normal
Reported Version Development
Due in Version Undecided
Due Date Undecided
Percent Complete 100%
Votes 0
Private No

Details

When looking up the work 'inquisitive' in wiktionary it displays a page full of symbols: http://en.wiktionary.org/wiki/inquisitive
This task depends upon

Closed by  Anonymous Submitter
2010-04-22 08:37:13 PM
Reason for closing:  Works for me
Comment by Dieter Plaetinck (Dieter_be) - 2009-06-25 05:48:55 PM
Right.
My guess is this is because we do not handle compressed pages.
see:
dieter@dieter-ws-a7n8x-arch ~ curl -I http://en.wiktionary.org/wiki/inquisitive
HTTP/1.0 200 OK
Date: Thu, 25 Jun 2009 00:49:48 GMT
Server: Apache
X-Powered-By: PHP/5.2.4-2ubuntu5wm1
Cache-Control: private, s-maxage=0, max-age=0, must-revalidate
Content-Language: en
Vary: Accept-Encoding,Cookie
X-Vary-Options: Accept-Encoding;list-contains=gzip,Cookie;string-contains=enwiktionaryToken;string-contains=enwiktionaryLoggedOut;string-contains=enwiktionary_session;string-contains=centralauth_Token;string-contains=centralauth_Session;string-contains=centralauth_LoggedOut
Last-Modified: Mon, 18 May 2009 14:25:55 GMT
Content-Encoding: gzip
Content-Length: 5496
Content-Type: text/html; charset=utf-8
X-Cache: MISS from sq38.wikimedia.org
X-Cache-Lookup: MISS from sq38.wikimedia.org:3128
Age: 60805
X-Cache: HIT from knsq2.knams.wikimedia.org
X-Cache-Lookup: HIT from knsq2.knams.wikimedia.org:3128
X-Cache: MISS from knsq30.knams.wikimedia.org
X-Cache-Lookup: MISS from knsq30.knams.wikimedia.org:80
Via: 1.1 sq38.wikimedia.org:3128 (squid/2.7.STABLE6), 1.0 knsq2.knams.wikimedia.org:3128 (squid/2.7.STABLE6), 1.0 knsq30.knams.wikimedia.org:80 (squid/2.7.STABLE6)
Connection: close

It uses gzip compression
Comment by Brendan Taylor (bct) - 2009-06-27 01:38:35 AM
It's definitely a problem with Content-Encoding, when I hardcode "Accept-Encoding: gzip" I see the same symptoms.

WebKit sends Accept-Encoding: identity, so servers shouldn't be gzipping their responses. A WebCore/ChangeLog says:

2009-05-28

Simplify the Accept-Encoding header we are sending out, for it
seems some servers do not enjoy parsing the full, explicit
version.

So maybe WebKit newer than 1.1.8 will fix your problem?

For a better fix (because gzip is nice), there's a libsoup bug about supporting Content-Encoding: http://bugzilla.gnome.org/show_bug.cgi?id=522772
Comment by 3288778 (g8723g82) - 2009-07-01 09:23:15 AM
No that new webkit won't fix the problem because some web masters just assume all web browsers accept gzip and send it to everything without reading the Content-Encoding header. Another example: http://circuitdesignonline.com/
Comment by Brendan Taylor (bct) - 2009-07-04 02:13:57 AM
Ugh, circuitdesignonline assumes that you can do gzip AND that you can do application/xhtml+xml. Even if you explicitly state otherwise. That's pretty terrible.
Comment by Semen Maryasin (MarSoft) - 2009-07-08 10:45:40 AM
Hm, 'inquisitive' page opens fine for me... (I'm using uzbl 9a32a5f8c8d18d143cf8b4a89531c866679026c7, libsoup 2.26.2, webkit 1.1.10)
And circuitdesignonline.com says:
___
This page contains the following errors:
error on line 1 at column 1: Encoding error
Below is a rendering of the page up to the first error.
---
(but with nothing below)
Comment by Sergey Alirzaev (L29Ah) - 2010-02-28 03:08:59 PM
According to https://bugzilla.gnome.org/show_bug.cgi?id=522772 the gzip support was included in libsoup-2.28.2. How do i utilize it?
Comment by Anonymous Submitter - 2010-04-22 08:37:02 PM
Loading the largest page on wikipedia works fine for me http://en.wikipedia.org/wiki/Line_of_succession_to_the_British_throne using the latest webkit version.

Loading...