Send Bristol mailing list submissions to
bristol@mailman.lug.org.uk
To subscribe or unsubscribe via the World Wide Web, visit
https://mailman.lug.org.uk/mailman/listinfo/bristol
or, via email, send a message with subject or body 'help' to
bristol-request@mailman.lug.org.uk
You can reach the person managing the list at
bristol-owner@mailman.lug.org.uk
When replying, please edit your Subject line so it is more specific
than "Re: Contents of Bristol digest..."
Today's Topics:
1. Forrin chars in web pages (Andrew McLean)
2. Re: Forrin chars in web pages (Nigel Sollars)
3. Re: Forrin chars in web pages (Alberto Lietor Santos)
4. Re: Forrin chars in web pages (Amias Channer)
5. Re: meetings (Ignacio Jordi Atienza)
----------------------------------------------------------------------
Message: 1
Date: Thu, 14 May 2015 14:39:08 +0100
From: Andrew McLean <am57762@gmail.com>
To: LUG <bristol@mailman.lug.org.uk>
Subject: [bristol] Forrin chars in web pages
Message-ID: <5554A57C.50001@gmail.com>
Content-Type: text/plain; charset=utf-8; format=flowed
I'm having some trouble with downloading web pages (using 'wget') and
converting them to text (using 'html2text'). The web pages sometimes
contain words (typically names) with foreign characters such as accented
e, c cedilla, o and i with two dots, etc etc.
These end up as pairs of character codes in the text file.
Anyone know of an easy way to address this, e.g. with an HTTP option in
wget ? I've looked at the docs for both tools, and nothing caught my eye.
I guess that the charsets in use are generally UTF-8. Maybe there's a C
library or tool that can do a conversion to the 'nearest' standard
character ?
I'll keep looking.
Ta,
Andrew M
------------------------------
Message: 2
Date: Thu, 14 May 2015 09:57:02 -0400
From: Nigel Sollars <nsollars@gmail.com>
To: Bristol and Bath Linux User Group <bristol@mailman.lug.org.uk>
Subject: Re: [bristol] Forrin chars in web pages
Message-ID:
<CAG6aBkVj-7On5CNOhNcerq9nX9Zt6=Q6qXCAFNKBV_6z1JtNEA@mail.gmail.com>
Content-Type: text/plain; charset="utf-8"
Hi,
Maybe not C, but perl and python seem to do it quite nicely.
Nige
On Thu, May 14, 2015 at 9:39 AM, Andrew McLean <am57762@gmail.com> wrote:
> I'm having some trouble with downloading web pages (using 'wget') and
> converting them to text (using 'html2text'). The web pages sometimes
> contain words (typically names) with foreign characters such as accented
> e, c cedilla, o and i with two dots, etc etc.
> These end up as pairs of character codes in the text file.
>
> Anyone know of an easy way to address this, e.g. with an HTTP option in
> wget ? I've looked at the docs for both tools, and nothing caught my eye.
>
> I guess that the charsets in use are generally UTF-8. Maybe there's a C
> library or tool that can do a conversion to the 'nearest' standard
> character ?
> I'll keep looking.
>
> Ta,
> Andrew M
>
>
> _______________________________________________
> Bristol mailing list
> Bristol@mailman.lug.org.uk
> https://mailman.lug.org.uk/mailman/listinfo/bristol
>
--
?Science is a differential equation. Religion is a boundary condition.?
Alan Turing
-------------- next part --------------
An HTML attachment was scrubbed...
URL: <https://mailman.lug.org.uk/mailman/private/bristol/attachments/20150514/019a21f8/attachment-0001.html>
------------------------------
Message: 3
Date: Thu, 14 May 2015 15:06:42 +0100
From: Alberto Lietor Santos <alietors@gmail.com>
To: Bristol and Bath Linux User Group <bristol@mailman.lug.org.uk>
Subject: Re: [bristol] Forrin chars in web pages
Message-ID:
<CACXZLW72W_WLAeN4gUvdV28pxn1RyGheBdGgC_HQ6TSSqGVS5w@mail.gmail.com>
Content-Type: text/plain; charset="utf-8"
Did you try using the parameter -utf8 on html2text?
2015-05-14 14:57 GMT+01:00 Nigel Sollars <nsollars@gmail.com>:
> Hi,
>
> Maybe not C, but perl and python seem to do it quite nicely.
>
> Nige
>
> On Thu, May 14, 2015 at 9:39 AM, Andrew McLean <am57762@gmail.com> wrote:
>
>> I'm having some trouble with downloading web pages (using 'wget') and
>> converting them to text (using 'html2text'). The web pages sometimes
>> contain words (typically names) with foreign characters such as accented
>> e, c cedilla, o and i with two dots, etc etc.
>> These end up as pairs of character codes in the text file.
>>
>> Anyone know of an easy way to address this, e.g. with an HTTP option in
>> wget ? I've looked at the docs for both tools, and nothing caught my eye.
>>
>> I guess that the charsets in use are generally UTF-8. Maybe there's a C
>> library or tool that can do a conversion to the 'nearest' standard
>> character ?
>> I'll keep looking.
>>
>> Ta,
>> Andrew M
>>
>>
>> _______________________________________________
>> Bristol mailing list
>> Bristol@mailman.lug.org.uk
>> https://mailman.lug.org.uk/mailman/listinfo/bristol
>>
>
>
>
> --
> ?Science is a differential equation. Religion is a boundary condition.?
>
> Alan Turing
>
> _______________________________________________
> Bristol mailing list
> Bristol@mailman.lug.org.uk
> https://mailman.lug.org.uk/mailman/listinfo/bristol
>
-------------- next part --------------
An HTML attachment was scrubbed...
URL: <https://mailman.lug.org.uk/mailman/private/bristol/attachments/20150514/472e3a45/attachment-0001.html>
------------------------------
Message: 4
Date: Fri, 15 May 2015 10:17:07 +0100
From: Amias Channer <me@amias.net>
To: Bristol and Bath Linux User Group <bristol@mailman.lug.org.uk>
Subject: Re: [bristol] Forrin chars in web pages
Message-ID:
<CAMgU7XV=K3_LJKzf53UyB56o56jXrGJm5xbT_OcBoLR1rx+aoA@mail.gmail.com>
Content-Type: text/plain; charset=UTF-8
Hello Andrew,
You might not have unicode support enabled in your shell , i think
both of those tools should
be able to handle utf8 if you did.
to test this : echo $LANG
mine says en_GB.UTF-8 which means it enabled
if its isn't it would be just en_GB or (horrors) C
you can change this by running export LANG='en_GB.UTF-8' and then trying again.
Not sure which distro you are using but it should have a method to set
this up permentantly.
Unless you are a kipper or running some seriously ancient hardware
there isn't a good reason
not to have UTF8 support on by default as it won't cause problems.
Cheers
Amias
On 14 May 2015 at 15:06, Alberto Lietor Santos <alietors@gmail.com> wrote:
> Did you try using the parameter -utf8 on html2text?
>
> 2015-05-14 14:57 GMT+01:00 Nigel Sollars <nsollars@gmail.com>:
>>
>> Hi,
>>
>> Maybe not C, but perl and python seem to do it quite nicely.
>>
>> Nige
>>
>> On Thu, May 14, 2015 at 9:39 AM, Andrew McLean <am57762@gmail.com> wrote:
>>>
>>> I'm having some trouble with downloading web pages (using 'wget') and
>>> converting them to text (using 'html2text'). The web pages sometimes
>>> contain words (typically names) with foreign characters such as accented
>>> e, c cedilla, o and i with two dots, etc etc.
>>> These end up as pairs of character codes in the text file.
>>>
>>> Anyone know of an easy way to address this, e.g. with an HTTP option in
>>> wget ? I've looked at the docs for both tools, and nothing caught my eye.
>>>
>>> I guess that the charsets in use are generally UTF-8. Maybe there's a C
>>> library or tool that can do a conversion to the 'nearest' standard
>>> character ?
>>> I'll keep looking.
>>>
>>> Ta,
>>> Andrew M
>>>
>>>
>>> _______________________________________________
>>> Bristol mailing list
>>> Bristol@mailman.lug.org.uk
>>> https://mailman.lug.org.uk/mailman/listinfo/bristol
>>
>>
>>
>>
>> --
>> ?Science is a differential equation. Religion is a boundary condition.?
>>
>> Alan Turing
>>
>> _______________________________________________
>> Bristol mailing list
>> Bristol@mailman.lug.org.uk
>> https://mailman.lug.org.uk/mailman/listinfo/bristol
>
>
>
> _______________________________________________
> Bristol mailing list
> Bristol@mailman.lug.org.uk
> https://mailman.lug.org.uk/mailman/listinfo/bristol
------------------------------
Message: 5
Date: Fri, 15 May 2015 09:18:49 +0000 (UTC)
From: Ignacio Jordi Atienza <ijordiatienza@yahoo.es>
To: Bristol and Bath Linux User Group <bristol@mailman.lug.org.uk>
Subject: Re: [bristol] meetings
Message-ID:
<2070176509.209342.1431681529486.JavaMail.yahoo@mail.yahoo.com>
Content-Type: text/plain; charset="utf-8"
Just to make sure, the next LUG meeting is on Saturday 30th, right? I used to check it at http://www.bristol.lug.org.uk/, but the last meeting mentioned there seems to be February 2015...
| ? |
| ? | ? | ? | ? | ? |
| Bristol and Bath Linux User Group: WelcomeA Linux group for users in and around the Bristol and Bath area. Featuring a mailing list and monthly meetings in central Bristol. |
| |
| Ver en www.bristol.lug.org.uk | Vista previa por Yahoo |
| |
| ? |
?
De: Peter Hemmings <peter@hemmings.eclipse.co.uk>
Para: Bristol and Bath Linux User Group <bristol@mailman.lug.org.uk>
Enviado: Lunes 11 de Mayo de 2015 15:07
Asunto: Re: [bristol] meetings
Please excuse top posting as this is a general comment/invitation.
Well done for offering yourself!!
Rather than going through the minutia of points at this time,? I suggest
you try and get to the KT in 2 weeks time when I and/or others who
organized the last event can give you some background information.
The first point is to establish the need and support for a special event
and if so what type, then get a small group together and come back to
the list with proposal(s).
FYI "LinuxLive" was a single event to get "ordinary" people to change to
Linux when XP became unsupported.
On 11/05/15 14:14, Ignacio Jordi Atienza wrote:
> Hey guys, thank you for considering my idea about more structured
> meetings. A few points I'd like to add to the pot, fwiw:
>
> *Someone mentioned this idea is not new, as there have been LinuxLive
> events. I've never been to one, maybe it could be better to take it
> from there, improving what already exists, rather than starting
> something new from scratch? (If the main problem was "poor
> promotion", trying out some countermeasure to that for the next
> event, and improve from there?)
>
> *Re: estimated attendance. Would it be too sci-fi to think of some
> online micropayment system where those who are going to attend could
> pay their share in advance? That would give a good scope of how many
> people would attend, rather than the, as mentioned, always shaky FB
> estimations (because clicking on an icon is way too easy, and also,
> some of us are allergic to that kind of services).
>
> *Re: venue. I currently have more time than money, so I offer myself,
> if there is quorum, to do the inquiries and post here a "report" on
> the different venues you have listed (Watershed, etc). Hackspace has
> the advantage that it is close to a train station, right? I also
> thought of the Central Library, which has a nice no-fuss wifi
> available. I thought the lack of projectors would be a problem, nice
> to know that there are 2 available already! :)
>
> The factors to investigate are then, correct me if I'm wrong:
>
> Capacity Wifi available Projector Food/Bar in nearby Price Means of
> transportation close
>
>
> *Re: "Speakers". Here is a proposal for a system, version 0.1 very
> beta: anyone who would like to share something in the meeting could
> post a "ticket" including 1) title and 2) approximate duration of the
> speak including Q+A. Joining and tailoring those tickets we would
> have a program for the session, and also it would give us an idea of
> how long we have to book the room. The problem here can be that there
> are either too many speakers, or not enough; if the problem was the
> second, I have a couple of ideas I could submit myself :)
>
> *Underlying motivations: "Using Linux" is kind of a broad label, so
> I'd like to say what would be my motivation for an event like this,
> which of course can differ for other people from other walks of life,
> and maybe foster some healthy debate. I see an event like this as:
>
> 1) Quality Leisure. Recently I was watching a Youtube video about the
> state of the kernel, and thinking, "god, I need more of this stuff!".
> I'd rather sit and listen to someone explaining why he thinks his
> window manager rocks, or a demo of someone's distro project, or how
> they hacked some piece of hardware to do X, than... any of the sad
> devices out there that a lot of people call "entertainment".
>
> 2) Community, giving back... all the feelgood stuff. I'm sure this
> format can help a lot of new projects and ideas appear, and problems
> be solved... Also, maybe at some point someone could bring a
> camera... I'm just playing around with the idea, forgive me if I go
> too far too soon, but wouldn't it be nice to put some content out
> there? Someone not long ago mentioned the wish to promote the LUG a
> bit more; a Youtube account would be cool, imo...
>
> Let me know what you think and let's advance the idea! Thx
>
>
> -------------------------------------------- El mar, 5/5/15, Fergus
> Allan <f_w_allan@yahoo.co.uk> escribi?:
>
> Asunto: [bristol] meetings Para: bristol@mailman.lug.org.uk Fecha:
> martes, 5 de mayo, 2015 16:45
>
> Hi,? When it comes to location, a factor is mode of transport.? If
> train is the preferred option, then getting much further away than KT
> is hard.? If car is the option, then there are a lot of other
> locations. Personally, I live in north Bristol (Frampton Cotterell)
> so I?m strongly on the side of car (possibly bus but that?s #6 to
> Bristol). If we go by car, then car sharing could be an option.? I
> can get a hall very cheap. Fergus -----Adjunto en l?nea a
> continuaci?n-----
>
> _______________________________________________ Bristol mailing list
> Bristol@mailman.lug.org.uk
> https://mailman.lug.org.uk/mailman/listinfo/bristol
>
> _______________________________________________ Bristol mailing list
> Bristol@mailman.lug.org.uk
> https://mailman.lug.org.uk/mailman/listinfo/bristol
>
Regards
--
Peter H
_______________________________________________
Bristol mailing list
Bristol@mailman.lug.org.uk
https://mailman.lug.org.uk/mailman/listinfo/bristol
-------------- next part --------------
An HTML attachment was scrubbed...
URL: <https://mailman.lug.org.uk/mailman/private/bristol/attachments/20150515/1af24ee1/attachment.html>
------------------------------
_______________________________________________
Bristol mailing list
Bristol@mailman.lug.org.uk
https://mailman.lug.org.uk/mailman/listinfo/bristol
End of Bristol Digest, Vol 600, Issue 2
***************************************
Tidak ada komentar:
Posting Komentar