Naming Conventions

Filesystem Limitations

Internationalization Issues

An interesting headache I encountered: I ripped and encoded a bunch of music on a Windows 2000 machine. I then moved all of this music to a Linux server via Samba (fairly stock install). That worked fine, and if I looked around at the files in Linux, thinks were ok.

After creating a Samba for the music on the Linux box, Windows decided to freak out if the name of a file or folder contained a high-ASCII character like "é". Long story short: Samba uses a default character set of UTF-8, and Windows uses code page 1252. Windows can't seem to read certain UTF-8 characters in folder/file names, so it goes into some unknown state when you try this.

The fix is to add the following command to smb.conf in the [globals] section BEFORE you copy your files over: unix charset = ISO8859-1. This instructs Samba to use Latin-1 encoding to write file and directory names, which in turn keeps Windows happy when it goes to read the filesystem.

If you did what I did and copy the music over before fixing Samba's config, you'll need to manually search for and replace any high-ASCII characters that may have been converted to UTF-8, and change them back to ISO8859-1 characters. Yes this sucks, the Unix find command is useful to do this, as is File::Find in Perl (you could automate the whole thing this way), and even Windows' search program works.

Unicode

I don't know enough about Unicode in filesystems to be coherent here, so if anyone has any comments, let me know. Obviously I'd love to support it to maximize internationalization of the software, but at present I'm just not that interested in researching it since I won't actually use it.

Delimiters

I've somewhat arbitrarily chosen ## to be the delimiter in directory and file names. This just makes it easier to parse :). I figure ## is pretty rare in album, artist, or song titles, so it's probably a sensible choice.

Directory Naming

Artists

Just the name of the artist. i.e.:

music/
	Rheostatics/
	Pink Floyd/
	The Doors/
	... 

File Naming

Live Shows / Bootlegs

Lord knows there's no shortage of info out there already on how to name live shows. There's equally as much info for recording shows. I like etree's naming standards, and hope to support it down the road, but for now we'll stick to a scheme similar to what is being used for other album types.

[BUGBUG: this whole idea is only at the proposal stage!]

The naming format will be very similar to a normal album, the main difference being that album titles will always start with a MM-DD-YYYY date followed by a venue name. i.e.

Medeski Martin & Wood/
	1993##It's a Jungle In Here/
	2000##Tonic/
	1995##07-12-1995 - Bluebird Theatre, Denver, CO/
		Medeski Martin & Wood##07-12-1995 - Bluebird Theatre, Denver, CO (Disc 1)##01##Dracula.mp3
		Medeski Martin & Wood##07-12-1995 - Bluebird Theatre, Denver, CO (Disc 1)##02##Sequel.mp3
		Medeski Martin & Wood##07-12-1995 - Bluebird Theatre, Denver, CO (Disc 1)##03##Chinoiserie.mp3
		...