Cloud Computing – Soup To Nuts

Did this presentation out at the IADT today.  Apart from one person say “I don’t understand anything you said” I think it went ok.

Still some work to do but its getting there. You can see the theme developing on my back catalog on slideshare.net.

Stop buying Rosemary – Its grows here!




w595 Jan 09 006

Originally uploaded by Joe Drumgoole

I have a bushel of this stuff growing like a weed (at least 100 packs worth) in my back garden in Dublin.

Rosemary is hardy as nails, lives comfortably through Irish winters. Why we have to import it from Spain is beyond me.

Who says there are no women in IT?

At the recent GeekNRolla event in London there was a panel on how to get more women into IT. One of the (well made) points was women in IT typical don’t describe themselves in IT. Despite working exclusively with or for IT companies they are in marketing or sales or PR or design etc.

So I put this to the test with the four women from Merrion BD who ran an excellent marketing event for tech companies here in the Digital Hub today.

There answers, marketing, sales, PR, business development etc. Not one considered they were working in IT. At most they would grant they were working with IT companies.

If we want to encourage women to work in IT, we need the existing women in IT to start admitting it 😉

GeekNRolla – One Line Takeways from Each Speaker

Inma Martinez Stradbroke Advisors : Umm, move your startup to the US?

Andy McLoughlin, CEO Huddle: The people you start with may not be the people who you finish with.

Joe Drumgoole, CTO and Founder PutPlace.com: Watch the per transaction costs in cloud computing.

Joe Stepniewski, co-founder Skimlinks: With advertising plays, go direct to sponsors rather than through CPM/CPC vendors

Jof Arnold CEOGymfu.com: Lots of people have got rich on AppStore but you probably aren’t going to be one of them

Leisa Reichelt, User experience Disambiguity: Create a persona for your users, they are not the general public

William Reeve  Angel Investor Lovefilm.com: We created 4.2m in free cash by proper management of payment schedule for creditors and debtors

Lesley Eccles Co-founder Hubdub: If you are launching in the US, launch in the US and ignore the UK/European market

Ian Hogarth, Co-founder, CEO Songkick.com: here is a wiki full of tools for startups http://startuptools.pbwiki.com/

Nick Halstead CEO Favorit: Avoid agencies and helpers and make direct contact with potential angel investors

Reshma Sohoni Seedcamp: Advertising as a business model is dead

Fred Destin Atlas Venture: VC’s are people too

Moore Hall – A Little piece of heaven in Mayo

It definitely helped to have the sunniest day of the year to date, but Moore Hall is stunning enough to transcend the worst Irish weather.

It is surrounded by amazing old forest land (now managed by Coillte, the Irish Forestry agency)

Lough Carra is adjacent and you can rent boats or go for a paddle (just like we did today).

 

http://maps.google.com/maps/ms?ie=UTF8&hl=en&t=h&msa=0&ll=53.710882,-9.224997&spn=0.011024,0.034761&msid=103490092289195700476.000467dc20fbeb11b93b8&iwloc=000467dc25607c22756b7&output=embed
View Ireland Tourism in a larger map

Amazon Web Services adds Map Reduce

Dear AWS Customer,

We are excited today to introduce the public beta of Amazon Elastic MapReduce, a web service that enables businesses, researchers, data analysts, and developers to easily and cost-effectively process vast amounts of data. It utilizes a hosted Hadoop framework running on the web-scale infrastructure of Amazon Elastic Compute Cloud (Amazon EC2) and Amazon Simple Storage Service (Amazon S3).

Using Amazon Elastic MapReduce, you can instantly provision as much or as little capacity as you like to perform data-intensive tasks for applications such as web indexing, data mining, log file analysis, machine learning, financial analysis, scientific simulation, and bioinformatics research. Amazon Elastic MapReduce lets you focus on crunching or analyzing your data without having to worry about time-consuming set-up, management or tuning of Hadoop clusters or the compute capacity upon which they sit.

Working with the service is easy: Develop your processing application using our samples or by building your own, upload your data to Amazon S3, use the AWS Management Console or APIs to specify the number and type of instances you want, and click "Create Job Flow." We do the rest, running Hadoop over the number of specified instances, providing progress monitoring, and delivering the output to Amazon S3.

We hope this new service will prove a powerful tool for your data processing needs. You can sign up and start using the service today at aws.amazon.com/elasticmapreduce.

Sincerely,

The Amazon Web Services Team

Egg Free Sponge Cake That Works

I found this recipe for egg free sponge cake from the vahrehvah.com cooking site. It uses condensed milk and icing sugar (they call it powdered sugar) along with coca-cola to give the necessary lift and binding that is missing when you remove eggs from a sponge recipe.

It works really well. The cooking times are pretty aggressive and I bottled out and left it in for 25 minutes instead of 15, which was a mistake. Go with the timings they give you.

The full text of the recipe is available on the vahrehvah website.

I made this chocolate icing which was extra yummy.

Amazon Launches AWS Toolkit for Eclipse

Dear Amazon EC2 Customer,

We are excited today to introduce the AWS Toolkit for Eclipse, a plug-in for the Eclipse Java IDE that makes it easier to develop, deploy, and debug Java applications on Amazon Web Services. With the AWS Toolkit for Eclipse, you’ll be able to get started faster and be more productive when building AWS applications.

The initial launch of the AWS Toolkit for Eclipse is targeted at Amazon EC2 developers and provides basic management features along with tools for deploying and debugging Java web applications.

The AWS Toolkit for Eclipse, based on the Eclipse Web Tools Platform, guides Java developers through common workflows and automates tool configuration, such as setting up remote debugger connections and managing Tomcat containers. The steps to configure Tomcat servers, run applications on Amazon EC2, and debug the software remotely are now done seamlessly through the Eclipse IDE.

The AWS Toolkit for Eclipse is available at no additional cost for Amazon EC2 developers. We encourage you to check it out and contribute code to the project.

To learn more about and download the AWS Toolkit for Eclipse, go to http://aws.amazon.com/eclipse.

Sincerely,

The Amazon Web Services Team

Top Tips For Configuring your NetBook

So you’ve bought a new NetBook (as millions of people have, its the most popular new kind of computer to buy), but it has two problems, the screen is too small and the processor is a bit sluggish. So a few simple steps can address some of these issues fairly quickly. All that follows presumes you are running Windows XP.

  • Change the Performance Settings on Control Panel -> System -> Advanced -> Settings to “Adjust for Best Performance” (stops visual cruft slowing you down)
  • Change the taskbar to autohide by selecting “Auto-hide the taskbar” on Control Panel -> Task Bar and Start Menu -> Taskbar (gives you more display real estate)
  • Now reset the fonts and border sizes on Control Panel -> Display Properties -> Appearance Advanced.  Set the Active Title Bar size to 16,  Active Window Border to 0, Captions Buttons size to 14,  Icon size to 24, Inactive Title Bar size to 15, Inactive Window Border Size to 0,  Menu Size to 15, Palette Title size to 15, Scrollbar size to 10 and  Selected Items to 15. This will scrunch all your Window’s decoration and widgets while still leaving them legible.
  • Turn off all those manky little apps on the right hand side of the tasbar at the bottom of the screen
  • Keep you disk drive at least 20% empty (Windows needs this amount of head room)
  • Make sure to empty your recycle bin regularily

MIX09 – Windows Azure Storage

Blobs

  • Two kinds of storage in Azure. SQL Data Services and Windows Azure storage. This is about the base storage offering
  • Three kinds of storage, blogs, tables and queues
  • All accessible via a REST API
  • Access secured via 256 bit (SHA256) key
  • Two separate data centres in US (Northwest, Southwest)
  • Affinity for storage and computation to reduce latency (available April)
  • Blobs – named objects, accounts, containers and blobs. (containers are like S3 buckets)
  • Tables – structured storage (like SimpleDB)
  • Queues
  • Sharing policies are set on a container basis
  • 8kb of name/value pairs can be associated  with each container
  • Listing abstractions for blobs in a container
  • Blob name space http://<Account Name>.blob.core.windows.net/<Container>/<BlobName>
  • Blobs can be up to 50GB in size
  • PutBlob, GetBlog, DeleteBlob
  • 8kb of metadata per blob
  • Support for MD5 checksum native to Storage API
  • Can use range get to retrieve a part of the  blog
  • Support for block level upload to allow interruptible uploads (S3 doesn’t do this)
  • PutBlock 1-N, then commit with PutBlockList
  • Blocks can be uploaded out of order or in parallel
  • Blocks can be uploaded twice, newer overwrites older
  • PutblockList will delete unused blocks
  • Blocks can be up to 4MB
  • Blocks can vary in size
  • Each block has a 64 byte ID, scoped by blob name
  • Overlapping get and put? Get will always see a single version of the blob. So while put is in process old blob is all that is seen
  • First PutBlockList wins in the case where multiple PutBlockLists occur.
  • Conditional Put/Get operations to support optimistic concurrency
  • Use a hash of Block to represent block ID

Tables

  • Billions of entities, TB of data
  • Highly available, durable
  • Account, table, entity are the key concepts
  • Table names are scoped by storage account name
  • A table is a set of entities (rows)
  • A entity is a set of propeties (columns)
  • Every table has a partition key column
  • Table partition, all entities in a table with the same partition key
  • Application controls granularity of partition key
  • A heavily partitioned table makes it easier to load balance
  • Entities in the same partition will be stored together
  • Multiple operations over multiple entities can be handled atomically in the future
  • Partition key and row key gives primary index
  • If partition key is part of query its fast, if it isn’t then the query ends up scanning
  • Each entity can have up to 255 properties, mandatory properties are partition key and row key
  • All entities have a system maintained version
  • No fixed schema, just name/value pairs
  • Access via ADO.NET Data Services (supports REST API)
  • Default number of connections is 2
  • 100-continue is default. Turn this off to save round trips.
  • Turn tracking off for read only queries
  • Bug in ADO.net relating to de-serialisation fix is to name the entity class the same as the table name
  • Be prepared for partial results from your queries
  • Query is limited to 60 seconds. After this results are returned and you must continue to get the rest
  • Not a relational database, no joins, foreign keys

Queues

  • Web Roles, Worker roles
  • Reliable message delivery
  • Access via REST
  • Account, Queue, Message
  • No limit on messages in queue
  • A message is stored for at most a week
  • Messages <= 8kb
  • http://<Account&gt;.queue.core.windows.net/<QueueName>
  • Create/Delete/Clear Queues
  • Enqueue/Dequeue/Delete
  • Dequeue makes message invisible. You delete after processing. If delete doesn’t get called timeout will make message visible once invisible time expires.
  • Queues are designed to be idempotent. Each message can be processed at least once, may be processed twice.
  • No fixed ordering for dequeue of messages, but approximates to FIFO
  • Use queue length to scale your worker tasks