July 2009
rufus-scheduler rdoc →
rufus-scheduler is a Ruby gem for scheduling pieces of code (jobs). It understands running a job AT a certain time, IN a certain time, EVERY x time or simply via a CRON statement.
June 2009
VRT: Fun with Shell Scripts and OS X →
Recently, more malware targeting OS X has been released. This is exciting stuff, and one such sample is RSPlug. The overall premise of RSPlug’s operation isn’t very sexy, as in the end it’s just a malicious script that an unsuspecting user is tricked into running on their computer. There is no exploit or internal propagation. For the curious, the end result, for the version being...
hdfs-fuse - Google Code →
hdfs-fuse lets you mount the Hadoop HDFS in the userspace.
Hadoop Tutorial - YDN →
Welcome to the Yahoo! Hadoop Tutorial. This tutorial includes the following materials designed to teach you how to use the Hadoop distributed data processing environment
Typetester – Compare fonts for the screen →
The Typetester is an online application for comparison of the fonts for the screen. Its primary role is to make web designer’s life easier. As the new fonts are bundled into operating systems, the list of the common fonts will be updated.
Agile Ajax » Using ActiveRecord to Migrate Legacy... →
Here’s your problem. You’re converting an existing project to Rails. The existing system has a database that, in the best case, doesn’t use Rails naming conventions. In the worst case, the old data is ill-structured, or uses a commercial database platform that you want to abandon.
[shell-fu:view-850]$ →
Do a sha256sum of an entire directory name directory and check for integrity. Modifying the IFS variable is necessary for filename with space.
Fifty Books for Our Times | Newsweek Books |... →
We know it’s insane. We know people will ask why on earth we think that an 1875 British satirical novel is the book you need to read right now—or, for that matter, why it even made the cut. The fact is, no one needs another best-of list telling you how great The Great Gatsby is. What we do need, in a world with precious little time to read (and think), is to know which books—new or old,...
RubyTu.be - Ruby Screencasts and Videos for Ruby... →
RubyTu.be is a community driven collection of Ruby related videos and screencasts.
Wireshark University - Tips →
When users complain about poor network performance, capture their traffic (from as close to their systems as possible so you get round trip time values from their perspective). Set the Time column value to show you from the end of one packet to the end of the next packet by selecting View > Time Display Format > Seconds Since Previously Displayed Packet. Now you can sort this column to see...
Hyper-Metrix.com →
The Burst Engine is an OpenSource vector animation engine for the HTML5 Canvas Element. Burst provides similar web functionality to Flash and contains a layer based animation system like After Effects. Burst uses a very light-weight JavaScript frame, meaning your animations will download un-noticeably quick and can be controlled using very simple JavaScript methods. For example: the [-] logo above...
Glimmer: a jQuery Interactive Design Tool -... →
Glimmer: a jQuery Interactive Design Tool is a prototype from the Mix Online Labs which makes jQuery accessible through a visual tool. The objective for Glimmer is pretty simple: to enable the power of jQuery through an interactive design surface. If jQuery is the “write less, do more” JavaScript library, then Glimmer is the “write none, do more” jQuery design tool. Glimmer has three core...
GX - Full-Featured Javascript Animations Framework →
GX is a full-featured, cross-browser, super-tiny (10kb uncompressed) Javascript Animations Framework. Using GX you can create complex animations working with every w3c CSS property.
$fx - JavaScript animation library →
Compact lightweight JavaScript library which extends DOM element by adding animation methods. Facilitates CSS properties and other parameters alteration along timeline. Supports parallel effects sets and effects chains. Has extended set of callbacks to adjust behavior.
10 Impressive JavaScript Animation Frameworks →
Complex and slick JavaScript-based animation has been made easier with the emergence of frameworks and libraries that give developers the ability to create stunning and eye-grabbing animation and transition effects that make it easy these complex tasks.
24 HTML Form Elements Customization Techniques |... →
HTML form elements do not render consistently across all browsers. While some form elements such as textbox and textarea can be styled through CSS to look the same on most browsers, others are ugly enough which can’t be ruled by CSS. Here are 24 solutions you can use to customize your form elements.
imemcacheclient-php - Google Code →
Improved Memcache client (currently only for PHP, but it can be written on any language). It supports Redis protocol (currently, in svn only).
http://stuff.mit.edu/afs/sipb/project/ruby-lang/lib... →
scanf for Ruby is an implementation of the C function scanf(3), modified as necessary for Ruby compatibility.
NodeBox | Home →
NodeBox is a Mac OS X application that lets you create 2D visuals (static, animated or interactive) using Python programming code and export them as a PDF or a QuickTime movie. NodeBox is free and well-documented.
Hadoop Live CD at OpenSolaris.org →
This project initial CD development tool aims to provide new users to Hadoop with a fully functional Hadoop cluster that is easy to start up and use. We have built a bootable CD-ROM image that provides users with a three-node virtual Hadoop cluster using OpenSolaris Zones. The CD is “live”, meaning that it does not modify the contents of the user’s computer. This makes it ideal...
PiggyBank - Pig Wiki →
This is a place for Pig users to share their functions. The functions are contributed “as-is”. If you find a bug or if you feel a function is missing, take the time to fix it or write it yourself and contribute the changes.
Cloudera's Distribution for Hadoop | Cloudera →
Cloudera’s Distribution for Hadoop is based on the most recent stable version of Apache Hadoop. It includes some useful patches back-ported from future releases, as well as improvements we have developed for our support customers.
Hadoop Training: Virtual Machine | Cloudera →
In order to make it easy for you to get started with Hadoop and complete our various training exercises, we have created a virtual machine with everything you need. The VM includes Cloudera’s Distribution for Hadoop, all of our example code, as well as eclipse and other standard tools.
Cyberwar - U.S. and Russia Differ on a Treaty for... →
The United States and Russia are locked in a fundamental dispute over how to counter the growing threat of cyberwar attacks that could wreak havoc on computer systems and the Internet.
Pig Latin Reference Manual →
To get started with Pig Latin, read the overview About Pig Latin. The reference table below lists the Pig Latin components.
Make something with Freebase - Freebase →
Get started by making a base: a collection of data about something you love. Bring your own dataset, or use some of Freebase’s over four million topics to make lists, galleries, and other views about your subject.
(theinfo) →
This is a site for large data sets and the people who love them: the scrapers and crawlers who collect them, the academics and geeks who process them, the designers and artists who visualize them. It’s a place where they can exchange tips and tricks, develop and share tools together, and begin to integrate their particular projects.
infochimps's wukong at master - GitHub →
Ruby libraries for efficient, effective Hadoop streaming
infochimps.org — Find Any Dataset in the World →
Once you find an interesting dataset, see what connects to it: by topic, source, format, whatever.
Eyealike →
At the forefront of visual-based search, Eyealike offers the first enterprise-class search platform for facial similarity (Eyealike Faces), image detection (Eyealike VisualAd and Eyealike Retail), and video copyright surveillance (Eyealike Copyright). Based on unique patent-pending technology, the Eyealike Visual Search Platform offers an entirely new approach that will dramatically advance the...
Hadoop Wiki →
[WWW] Hadoop is a framework for running applications on large clusters built of commodity hardware. The Hadoop framework transparently provides applications both reliability and data motion. Hadoop implements a computational paradigm named Map/Reduce, where the application is divided into many small fragments of work, each of which may be executed or reexecuted on any node in the cluster. In...
Kickass Labs » Blog Archive » Hadoop Streaming for... →
Note: This article assumes that you know a little about MapReduce, or that if you don’t, you might skim the enclosed links so you know what I’m talking about when I get to the examples, or check out the Hadoop Tutorial. It also assumes that you have Hadoop set up - either clustered or pseudo-clustered - if you’re going to run the examples. Or you can just read along.
Git at Apache →
This is an collection of read-only Git mirrors of Apache codebases. The mirrors are automatically updated and contain full version histories (including branches and tags) from the respective source trees in the official Subversion repository at Apache.
SourceForge.net: kosmosfs » home →
Kosmos filesystem is a high performance distributed filesystem for web-scale applications such as, storing log data, Map/Reduce data etc. It builds upon ideas from Google’s well known Google Filesystem project.
Hadoop →
Apache Hadoop is a free Java software framework that supports data intensive distributed applications.[1] It enables applications to work with thousands of nodes and petabytes of data. Hadoop was inspired by Google’s MapReduce and Google File System (GFS) papers.
FileHero.com - File tasks at your service! No... →
Welcome to FileHero.com: Your destination for downloading, converting, and viewing videos from popular media sites! We are still in the beta phase of this project, so please stop by often to check out any new and improved functionalities! For any additional information of questions, please refer to our FAQ or leave any suggestions or questions.
Linux Books - Free E-Books →
E-Books for free online viewing and/or download
Text utilities →
Text utilities
Face detection in pure PHP (without OpenCV) -... →
Lately, I’ve been looking for ways to detect faces in photos with PHP. Nowadays, face detection is built in many consumer products (camera obviously, but also Google and iPhoto), and seems to be a pretty common job. So I expected to find many solutions for doing it with PHP. Surprisingly, the only one I could find is OpenCV, an opensource lib that was originally developed by Intel. OpenCV...
Mac OS X 10.5 Leopard font management →
This page provides a comprehensive overview of the way fonts are handled by Mac OS X 10.5.
The Billion Dollar HTML Tag « Data Center... →
Can a single HTML tag really make a difference on a corporation’s financial results? It can at Google, according to Marissa Mayer, who says web page loading speed translates directly to the bottom line.
git-svn(1) →
git-svn is a simple conduit for changesets between Subversion and git. It provides a bidirectional flow of changes between a Subversion and a git repository.
Apple iPhone 3G S: The sum ($) of its parts |... →
The iPhone, of course, is more than the sum of its parts, but the cost of individual components adds up—to $178.96, to be exact.
Let's make the web faster - Google Code →
What would be possible if browsing the web was as fast as turning the pages of a magazine? We invite you to join us in exploring and innovating across the entire spectrum of performance - from Internet protocols to the browser to website development. Together, let’s make the web faster!
Keep a Backup of Installed Packages →
You might prefer to have a clean system on reinstall but sometimes it is nice to reinstall applications from a previous machine/setup. Keeping a backup list of packages will make this a snap. Just give your package manager a list of all the packages you want it to install and let it rip.
Sugar on a Stick - Sugar Labs →
Sugar Labs offers ubiquitous access to Sugar in a USB (Universal Serial Bus) flash memory drive (stick). The Sugar on a Stick project gives children access to their Sugar on any computer in their environment with just a USB memory stick. Taking advantage of the Fedora LiveUSB, it’s possible to store everything you need to run Sugar on a single USB memory stick (minimum size 1GB). This small...
Jake →
Jake is 100% free, open source and available for Linux, Windows and Mac OS X. We’re using free technologies like Jabber and open source chat server engines.
Report: Global Proxy Effort for Iran is Faltering →
Network analysts Renesys reported this morning that the global effort to supply proxy internet servers for Iranians to route around government control and communicate with the outside world is slowing down and facing increasingly effective state repression. The company mapped two thousand proxy servers shared on Twitter and other web sites over the course of the last week and found that it truly...
infodan: Managing static versioned libraries in OS... →
The Mac OS X offers great tools to developers in order to manage libraries with many versions: Frameworks. Anyway sometimes you may want to use static libraries instead. Maybe just as a means of (psycologically) obfuscating your preciuos code you don’t want to put in a framework for the world to know…