From KCHUNG wiki

Manuals: KCHUNGArchive

everything that has ever been broadcast on KCHUNG (regular programming from the studio, special events, remote broadcasts, news programming etc) is archived and made accessible for searching and download at http://kchungradio.org/archive.html.

archived mp3 files can be browsed by date or by show, and can be searched by keyword, artist / guest names, episode titles, location / venue etc.

all of this metadata is stored in a database hosted by Los Angeles Contemporary Archive, a partner organization of KCHUNG.

editing database information for shows

any kchung community member can edit information about KCHUNG shows by clicking the "edit" link under any file in the archive, or logging into to the LACA site, and using the user name "kchung" and the password "Kchungarchive99".

The information that can be added or edited include:

once you've logged in, you can also create or edit a description for any show / series as a whole, by visiting http://lacarchive.com/node/add/series-details or clicking the "add show details" that appears at the bottom of search results returned when browsing by show in the kchung archive.

any information about guests, playlists, descriptions, keywords etc that you add help make the database more searchable / more useful... go wild.

troubleshooting archive issues

for any issues with the archive, including missing or mis-labeled shows, broken links, duplicate uploads, merging multiple search-terms / show names / artist names into one term etc, or just general help with using the archive, email archive@kchungradio.org. suggestions on how to make the archive work better are also very much appreciated!

there is also the opportunity to get involved as an archive "power user", i.e., digging a little deeper into the structure of the archive, bulk-editing, and helping things to work better. if you are interested in this, send an email to archive@kchungradio.org or see the archive troubleshooting page.

how the archive gets synchronized / the archive-sync script

once they've been converted and named by station managers, archive files are hosted locally as mp3's in the ~/desktop/archive folder, sorted in folders according to date. this folder gets backed up to external hard drive irregularly, and very rarely does anything in this folder need to be deleted. hi-res versions of files are stored in the ~/desktop/wav+aiff file archive folder - this folder needs to be dumped to external hard drive more frequently, and older files deleted.

the archive files (as mp3's) are hosted remotely in a dreamhost "dreamobjects" account - functionally equivalent to an amazon s3 cloud storage "bucket". the structure of an s3 bucket is special - each bucket has a username and two keys for authentication, each file and directory are stored as "objects". the bucket can be browsed using dreamhost's cpanel or an application like cyberduck - i've set cyberduck up on the kchung studio computer to automatically log in with the username and security keys. cyberduck displays the bucket as if it were a typical directory structure - files inside of directories, but it's worth remembering that each object is essentially horizontally related to the others. the bucket has a cname associated with it that creates a url for each file in the format: http://archive.kchungradio.org/date-folder-name/filename.mp3, this is how one would access / download each file directly.

the database information, displayed on the kchung website as an iframe, comes from LACA's drupal, which manages a mysql database that is stored on their server. kchung has two users created in LACA's drupal that can edit this database - "kchung" (basic user) and "kchung admin" (advanced user). this database is automatically populated every time the archive-sync script is run (see below). drupal uses a regex parser to determine the date and show title from the filename (this is why it needs to be in the format "show_title-month.day.year.mp3").

on the kchung studio computer, a calendar alert is set to run the archive-sync script automatically in the background every day at 4 am. "archive-sync" is an automator script - a workflow including shell scripts and python scripts. all components, in addition to the automator script, live in the ~/scripts directory. note that the archive script does not need to be run manually - the entire process runs on its own once a day

you can open the archive-sync script in automator and look at each step. first, an rsync-like python utility called "boto-rsync" does a dry run, looking for files in the ~/desktop/archive folder that are not already in the dreamobjects bucket. this produces a text document: ~/log/archive-temp-boto.txt. next, a python script parses the text file to make a new file: ~/log/archive.csv that is just a list of filenames for each mp3 (no directories, no .ds_store files) in the correct format for drupal to auto-populate the database with show title, date, and url for each file. the newly created archive.csv is then uploaded to http://kchungradio.org/log. next, LACA's auto-populate script is called using curl. at this point, LACA's drupal parses the csv to create database entries. after waiting a minute, the remote file http://kchungradio.org/log/archive.csv is deleted to make sure there's no duplicate entries. meanwhile, boto-rsync is run again, this time in active mode, synching new local files with the dreamobjects bucket. finally, the script queries the LACA database for kchung files, to fill the cache and (hopefully) speed up pageloads.

  1. advanced troubleshooting
Retrieved from http://www.kchungradio.org/wiki/index.php?n=Manuals.KCHUNGArchive
Page last modified on May 20, 2017, at 01:57 PM