Saturday, April 12, 2008

Links 2008-04-12: Jaql, Hadoop, ThruDB, Python & AWS, DNS

  • PottyMouth transforms completely unstructured and untrusted text to valid, nice-looking, completely safe XHTML. PottyMouth is designed to handle input text from non-technical, potentially careless or malicious users. It produces HTML that is completely safe, programmatically and visually, to include on any web page. PottyMouth is ideal for displaying blog comments, text email bodies in a web mail application or mailing list web archive, or any text fields on any site with user input text, such as a social networking, dating, or community site. In short, any input which is displayed in HTML and is input as text by a non-technical and/or untrusted user.
  • Jaql - a query language for JSON Demonstrates an example of JSON data, describes the key features of Jaql and shows how it can be used to process JSON data in parallel using Hadoop's map/reduce framework.
  • Pig is a dataflow programming environment for processing very large files. Pig compiles these dataflow programs into (sequences of) map-reduce jobs and executes them using Hadoop. It is also possible to execute Pig Latin programs in a "local" mode (without Hadoop cluster), in which case all processing takes place in a single local JVM.
  • Hadoop On Demand is a system for provisioning and managing independent Hadoop MapReduce and HDFS instances on a shared cluster of nodes. HOD is a tool that makes it easy for administrators and users to quickly setup and use Hadoop.
  • Hadoop Streamng allows you to create and run map/reduce jobs with any executable or script as the mapper and/or the reducer. For example:
    $HADOOP_HOME/bin/hadoop  jar $HADOOP_HOME/hadoop-streaming.jar \
        -input myInputDirs \
        -output myOutputDir \
        -mapper /bin/cat \
        -reducer /bin/wc
    
  • Red Black Trees Tutorial
  • Jing Project - The concept of Jing is the always-ready program that instantly captures and shares images and video…from your computer to anywhere.
  • Historical Graphs for Mortgage Rates
  • The JasPer Project is an open-source initiative to provide a free software-based reference implementation of the codec specified in the JPEG-2000 Part-1 standard (i.e., ISO/IEC 15444-1).
  • DNS for Rocket Scientists Online guide about DNS and (mostly) BIND 9.x on Linux (Fedora Core), BSD's (FreeBSD, OpenBSD and NetBSD)
  • ThruDB Document Oriented Database Services
  • PyAWS

Technorati Tags: , , ,

No comments: