“Like” Harvesting: What is it? Why does it matter?

facebook LikeWe have all seen facebook posts similar to the one with a cute little girl holding a sign that reads “My dad said that if I get a million ‘likes’ he will buy me a puppy.” While these posts appear innocent, they rarely are. Most of these posts are intended to gain enough “likes” to raise Facebook’s score for the page associated with the post. The higher score these pages have gained allows these pages to have much broader distribution when they post the subsequent advertising; sometimes these pages are offered for sale to other advertisers looking for the broad distribution of their adds (even though this is prohibited by facebook’s policies). Realize that most of the time you are not helping a little girl get a puppy,¬† encouraging a child with cancer that people care for him, or encouraging a handicap person to run a marathon; you are allowing yourself to be targeted with advertising you almost certainly did not want. While participation in a “like” harvesting scheme will result in nothing more than receiving annoying (and possibly offensive) ads, other internet SCAMS can have much more severe consequences.¬†Always realize that there just may be malicious motives behind the seemingly innocent requests you receive. In the anonymous world of the Internet, it is always best to approach requests from those you do not personally know with a great deal of suspicion.

Choosing a storage platform: P2

Choosing a storage platform: continued

  1. What application or applications will be supported on this storage platform?
    • Understanding requirements of the application or applications placed in service on the storage platform is a critical piece of information needed when choosing a storage platform. Probably the best way to illustrate this issue is to look at a storage platform like EMC’s Isilon platform. It scales extremely well when presenting NAS volumes to hundreds or even thousands of clients. Performance increases almost linearly with each added shelf because the I/O is distributed across all of the nodes (shelves) in the cluster. The size of existing volumes can be expanded or reduced without taking volumes offline and shelves can be added or removed from production clusters without service interruption; this allows for tremendous flexibility in adapting to rapidly changing storage needs. However, if you intend to run a large OTP database, Isilon is probably the wrong storage platform for the job. The Isilon platform provides reasonably good throughput to each client and can sustain that throughput to a large number of clients simultaneously, but it is not a great choice when very high performance is needed for only a couple of clients (something typical of OTP databases). Additionally, most OTP databases are going to perform best when a LUN is presented via a SAN infrastructure but Isilon performs best when presenting NAS filesystems.
    • Do multiple applications intended for use on this storage platform have significantly different requirements? Some storage platforms perform well in a SAN configuration but poorly in a NAS configuration, others perform well in a NAS configuration but poorly in a SAN configuration.Some perform well in one configuration or the other but not both simultaneously, and some storage platforms do a reasonably good job when simultaneously publishing both SAN and NAS volumes. Knowing how you intend to use your storage can help narrow the choices considerably.


  2. What are the performance requirements needed for your applications?
    • One important, but often overlooked, aspect when evaluating performance needs of a storage platform is the labor costs incurred by a slow system. When end users incur delays that inhibit their ability to complete their work quickly, labor costs increase. Understanding how storage performance impacts the interactive performance of the application utilizing that storage is critical to evaluating your performance needs. For example, if a $50,000 investment is needed to double the interactive performance of an POS application and the current response time is .1 seconds and the increased performance results in a new response time of .05 seconds, the investment in the higher performing system was probably a waste of money. However, if the current response time is 30 seconds and a faster storage system will reduce it to 15 seconds then the investment may be warranted. If this system supports a call center staffed with 100 employees taking orders and if the average order is $20 and the time savings due to the increased interactive performance allows for 4 additional calls per day / per employee. The daily ROI would be $8000 dollars per day with the initial investment paid for in only 7 days! Even if the cost for a faster storage platform in this example had been $500,000 it would have still been a very smart investment.


  3. What are the recurring costs for administering the storage?
    • Do I already have staff trained to administer this storage platform and what will it cost to keep my staff current as software is updated? Continued training across multiple storage platforms can be costly and a lack of training can be even more costly if it leads to a critical mistake.
    • What are the recurring costs for H/W and S/W maintenance and (very important) will these costs change significantly in the 2nd year, 3rd year, 4th year, etc…?

Finally, it is important to remember that many considerations we make when choosing a storage platform are interdependent. For example, a comprehensive report detailing the expected ROI for different storage platforms may open the “purse strings” when it comes to budgeting. A need for extremely High Availability and Disaster recovery may lead to compromises on performance, the applications we use may have better support on certain vendor’s storage platforms (especially related to backups and cloning), etc… It is critically important to consider all aspects of your computing environment before choosing a storage platform.

Choosing a storage platform: P1

A brief overview:

Choosing a storage platform for your computing environment is one of the most critical decisions to be made and one of the most difficult. There are many different approaches to solving storage problems and each has its benefits and its drawbacks. Understanding the specific needs for your project is a critical first step in the process of choosing a storage platform. I would like to explore some of the key considerations that should be understood before “shopping” for new storage begins.

  1. What is the budget?
    • Storage platforms can span a very wide range of costs. Simple platforms can be obtained for only a couple of thousand dollars but enterprise class systems can cost millions. Knowing your budget before calling a vendor can help you and your sales person avoid wasting time looking at systems and features you cannot afford.
    • Remember that storage platforms are usually heavily discounted so it is important that you do not base your choices on the list price of the system. At times I have seen discounts well over 60% when the vendor is hungry for the sale. For this reason alone, it is usually unwise to look at storage from only one vendor; when there is competition, there is a much better chance at getting the best deal.


  2. What are the Service Level requirements that must be met?
    • Fault tolerant storage platforms can be very expensive and so it is very important to understand what SLA’s are required before searching for a storage system. In an environment where most business can be conducted “offline” in the event of a system failure, a basic storage platform with next day parts replacement may be suitable. However, in an environment, like a large OTP database, where tens of thousands or sometimes millions of dollars can be lost each hour the system is down, the cost of highly fault tolerant storage is just part of the cost of doing business; these costs are a drop in the bucket compared to the losses that could potentially be sustained in the event of a storage system failure.
    • When remote replication is required there are a number of considerations that come into play when deciding on the appropriate technology.
      1. What are the bandwidth requirements needed to replicate the data and what are the recurring costs associated (Budget considerations again).
      2. How much data loss is acceptable in the event of a failure that requires a move to the DR systems. Many replication strategies update the remote site only periodically i.e. once an hour, once a day, etc… If a storage system were to fail or be destroyed in a fire, flood, etc… near the end of the update window, all data updated during that window would be lost.
      3. Is replication of your applications best handled by storage replication or replication at the applications level? While storage vendors will always want to sell their replication, often the best strategies for updating database applications are done at the applications level and not at the storage level because the recurring costs of the WAN tend to be less and the time window for potential data loss is much shorter.
    • When considering the requirements for data protection, it is important that one not forget about backup and restore strategies. Storage platforms today are often an integral part of any backup and recovery process. Here are some considerations regarding backup and recovery.
      1. Is there a time window for backups that is sufficient to complete required backup. For applications that must be backed up while online, storage snapshot support is often required. Additionally, supplemental software is often also required to ensure that an application’s data on disk is in a “backup ready” state when the snapshot is initiated. These features often require supplemental licenses at additional costs.
      2. Are online periodic snapshot backups required? Many sites take periodic snapshots during the day to ensure that a minimal amount of data loss is incurred if a file or database becomes corrupted. Note: online snapshots are never an appropriate substitution for regular backups because a single raid failure can cause all snapshot data to be lost simultaneously. Most snapshot data is composed of a single copy that is shared between all snapshots.