Web authoring advice

Saturday, October 1st, 2005 10:39 pm
lethargic_man: (Default)
[personal profile] lethargic_man
I've been sort-of commissioned to make a website for Marom. Now the only complex websites I've made until now have been strictly functional ones for work, driven by Perl CGI backends and with no security measures, as behind work's firewall. I can stumble my way through simple JavaScript from knowledge of C++, but don't know any Flash.

So I have a few questions for anyone who's made a public-facing website beforehand:

Firstly, any recommendations for an host ISP? This is a site of interest probably only to a few hundred people, but those few hundred might be visiting it frequently.

Secondly, as regards making it updatable by non-techies, I was thinking of providing proformas and inclusion of "what's on this week" files, etc, by server-side includes. Are server-side includes something ISPs are likely to provide?

Thirdly, would I stand a chance of having Perl available as a backend? And if so, how do you make it secure? I gather taint-checking is involved; once I've taken that into account, can I leave it all to Perl? Actually, I probably don't need Perl for the basic proposition, but when I was talking about it with Assael, he waxed lyrical about the possibilities -- message boards, a dating subsite, the works -- and it would be nice to be able to extend the site after its initial launch.

Any other advice whilst I'm at it?

Date: 2006-05-27 10:43 pm (UTC)
ext_8103: (Default)
From: [identity profile] ewx.livejournal.com
Yes, but if you're going to do that, I'm not sure why you want HTML input in the first place?

Date: 2006-05-28 08:38 am (UTC)
From: [identity profile] lethargic-man.livejournal.com
I don't; I want plaintext. Only the plaintext is getting interpolated into the HTML in place of the webform for forwarding on. The end-user doesn't know that, but I can't risk them second-guessing it.

Date: 2006-05-28 10:24 am (UTC)
ext_8103: (Default)
From: [identity profile] ewx.livejournal.com

Ah, right. I'm unclear at what point you're substituting into HTML, but whatever point that is is the point you should SGML quote it, and not before.

I would:

  • Convert at least < and & into entities;
  • Strip out control characters (0-8, 11-31, 127-159);
  • Convert anything outside ASCII range into numeric character references, avoiding the need to know what encoding the output uses

Obviously you need to know what the input encoding is in order to be able to correctly interpret any bytes outside the 0-127 range. DisOrder uses <form ... enctype="multipart/form-data" accept-charset=utf-8> to request that the input be UTF-8.

Once you have encoded every character with special meaning, there is no need to delete script tags, since there's no possibility of anything being interpreted as a tag any more.

Date: 2006-05-28 11:03 am (UTC)
From: [identity profile] lethargic-man.livejournal.com
Great, thanks.

Profile

lethargic_man: (Default)
Lethargic Man (anag.)

February 2026

S M T W T F S
1234567
8 91011121314
15161718192021
22232425262728

Most Popular Tags

Style Credit

Expand Cut Tags

No cut tags
Page generated Saturday, February 28th, 2026 08:18 pm
Powered by Dreamwidth Studios