Share Facebook Twitter Google + LinkedIn Pinterest National Corn Growers Association First Vice President and Minden, Iowa, farmer Kevin Ross spoke in support of the Environmental Protection Agency’s (EPA) proposed rule to allow for year-round sales of E15 across the country.“Farmers stand ready to work with the Administration to clear obstacles to higher blends of ethanol such as E15 and ensure a final rule works for the full ethanol and fuel supply chain,” Ross said. “To ensure E15 sales are not interrupted, NCGA urges EPA to complete this rulemaking by June 1.”Ross’s comments came during a hearing held as part of the rulemaking that would remove outdated regulations requiring retailers in many areas of the country to stop selling E15, a blend of gasoline and 15% ethanol approved for all vehicles 2001 and newer, during the summer months.Year-round E15 is a no-costs means for farmers to grow demand. It also saves drivers between 3 and 10 cents per gallon and reduces greenhouse gas emissions.During the hearing, Ross also spoke on the negative impacts EPA’s expansive Renewable Fuel Standard (RFS) waivers for refiners have had on farmers who are already struggling amid declining farm incomes and trade disruptions. NCGA has urged the EPA to rein in RFS exemptions provided to profitable refiners.Ross also urged caution around the proposed rule’s complex RIN market reform proposal.“To ensure E15 sales are not interrupted, NCGA urges EPA to complete this rulemaking by June 1, as well as keep the complex RIN market reform proposal from weighing down the E15 rule,” Ross said.
Struggling with even the most cutting-edge big-data tools, developers at Facebook have announced a new open-source tool set, Corona, designed to enable data-processing on the incredible scale required by the massive social-media network.Admittedly, there aren’t a lot of companies with big data this big, but Corona should have spill-over benefits for the entire Hadoop-using big data community. Increasing Hadoop’s overall efficiency should make big data even more accessible and faster to use, enabling big data applications to disrupt even more industries.Conventional databases that reach into the terabytes are difficult and expensive to work with. It’s not just the size of databases that cause problems, either. Growth – of data itself, systems and queries – can waylay the most expertly managed database. It’s hard to keep all of the data tables and relationships intact across the new machines you have to keep adding to store all that data.Really, Really Big DataKeep that in mind, and then ponder Facebook’s problem: Its largest collection of data is 100 petabytes. That’s 1,048,576,000 gigabytes. Think you’re pretty bad ass with your 3TB hard drive? This is the equivalent to over 34,000 such drives. And that’s just Facebook’s largest collection of data, not all of it.And growth? According to Facebook engineers, they’re getting about half-a-petabyte coming in from the social network every 24 hours – 512 terabytes every single day.This is the kind of data load that would make all but the most expensive database software and hardware on the planet go pfft! with a tiny puff of smoke and a barely audible scream of pain from the server room.Tools like Hadoop have been incredibly useful in collecting and working with this kind of massive scale. But in a blog post today from Facebook engineers Avery Ching, Ravi Murthy, Dmytro Molkov, Ramkumar Vadali and Paul Yang, even Hadoop is straining under the load. That 100-petabyte data cluster, for instance? It’s the largest known Hadoop cluster in the world.The Limits Of HadoopHadoop isn’t really a database in the usual sense. It’s a two-part data processing system.Part one is storage and tracking data across multiple machines. Part two is the MapReduce system that takes a query about the data, figures out all the places in the cluster where the data is sitting, and then divides the query up to be processed on the same machines. Once processed, the answers to the query are combined and delivered. No one machine has to do all the heavy lifting, and adding a new machine to a data cluster adds storage and processing power.But in their blog today, the Facebook engineers highlight a limitation in Hadoop that has plagued Facebook and many large-scale Hadoop users. All of that MapReduce work described above has to be done in batches, which in turn have to be scheduled so that machines in a given cluster can do the work required without overwhelming the resources of the cluster.By default, Hadoop uses a fair scheduler, which basically means first come, first served. For Facebook, the limitations of fair scheduling are very real. In early 2011, the team noted, the job trackers that schedule queries and manage cluster resources were having trouble keeping up.Imagine a vast supermarket full of all sorts of stuff. At the entrance to the market is a collection of grocery carts, all various sizes. To enter the market, you have to wait in line to get a cart. The size of the cart you get depends on how much is on your grocery list. Just need milk and bread? You get a tiny cart. Stocking up for Thanksgiving? You get the jumbo cart.Here’s the other catch: Only a limited number of shoppers are allowed in the supermarket at a time. So, everyone has to wait while a few shoppers have the run of the store and check out at the (single) register.That’s a rough idea of how Hadoop works now. Frustrated with this system, Facebook’s team is opting to change this model with a new scheduling framework, known as Corona.The Rise Of CoronaCorona divides the job tracker’s responsibilities in two. First, a new manager manages cluster resources and keeps an eye on what’s available in that cluster. In our grocery-store analogy, Corona lets more shoppers into the store and opens more checkout lanes.At the same time, Corona creates a dedicated job tracker for each job, which means the job tracker no onger has to be tied to the cluster. With Corona, smaller jobs can be processed right on the requester’s own machine. Continuing the shopping example, it’s like putting an automat outside the store that will give you anything in the store, but a few items at a time.Facebook has been using Corona for some time, according to the blog, and is still working to improve it, as well as to apply its improved scheduling features to other types of applications. More important, the team revealed that their efforts have been open-sourced, which means other developers can take this work and apply it to their own Hadoop systems.Improved job scheduling in Hadoop is a pretty big deal, if only because any Hadoop system can be improved with this kind of functionality. It’s something the Apache Hadoop developer team has already been working on in their new version of MapReduce 2.0, with the introduction of the YARN ResourceManager. But at the scales Facebook is working in now, the engineering team could not wait for YARN.Given that Corona is also open source, there’s a good possibility that work on MapReduce 2.0 will benefit from Facebook’s efforts, improving everyone’s Hadoop experience. That’s something anyone in big data can Like.Image courtesy of Facebook. Massive Non-Desk Workforce is an Opportunity fo… 3 Areas of Your Business that Need Tech Now brian proffitt IT + Project Management: A Love Affair Tags:#Big Data Cognitive Automation is the Immediate Future of… Related Posts
The Haryana police on Monday released a list of over 40 people who were wanted in connection with the violence that shook Panchkula on August 25, after Sirsa-based Dera Sacha Sauda chief Gurmeet Ram Rahim was convicted in two counts of rape of female followers. The list of 43 accused released along with their photographs had the name of Dera chief’s adopted daughter Honeypreet on the top, and Aditya Insan, Dera’s spokesperson, as the hunt for the accused continued. B. S. Sandhu, DGP, said more names could be added to the list. “We are in touch with the police of other States to track Honeypreet and Aditya Insan,” he said.
Prime Minister Narendra Modi will embark on his Lok Sabha election campaign in West Bengal on Saturday, several weeks before the announcement of the poll schedule. He will address two rallies in the State, first at Thakurnagar in North 24 Parganas, and the second at Durgapur.The reason, as declared by the BJP, is to capture half of the State’s 42 Lok Sabha seats, the third highest among the States.Mr. Modi is expected to address close to “100 rallies” in Bengal and the Northeast in the run-up to the poll, as the region has 67 seats, with 84% located in Bengal and Assam. But it’s not just the BJP embarking on a campaign trail; the united Left Front is also bringing in its star campaigners to kick-start its campaign from the city’s mega rally venue — the Brigade Parade Ground — on Sunday.The BJP’s poll strategy, according to party insiders, is three-pronged. First, it is planning to routinely fly in big leaders to campaign in the State. Second, the party is identifying and areas to engage its workers and leaders. One such area is Thakurnagar, the headquarters of the predominantly Dalit refugees from Bangladesh over decades, where Mr. Modi will address his first public rally amid the citizenship Bill controversy. The Prime Minister will then fly to Durgapur which falls under Asansol, a seat last bagged by the BJP.Third, the BJP is planning to place its cadres booth-wise and also pressurise the Election Commission to deploy more Central paramilitary forces in the State.Social media campaign The Left Front has launched a massive social media campaign to make its Brigade rally on Sunday a success. While former CM Buddhadeb Bhattacharya has said the rally would “pave the way for a bigger resistance”, the CPI(M)’s social media managers have specially designed and posted a series of videos, songs and interviews. The event has gained momentum and the Left leaders are hopeful that it will be able to give the united Opposition rally “a run for money”.The PM, too, is expected to address a Brigade rally on February 8.