Friday, January 31, 2014

Aweekstweets January 24-31 2014: the week Pete Seeger went to flowers

US patent #8,375,213: Systems and methods for enabling trust in a federated collaboration (http://ow.ly/taBRE ). Issued 2-12-2013

Re-reading one's own work after an intense writing session is like awakening from dream. Was that train running down the track in my head?

Finished drafting all o next week's quick-hits this morning. Re-reading. Connected a lot of conceptual dots across my readings & my writings

"Deep Web Intelligence Platform: 6+ Capabilities Necessary 4 Finding Signals in Noise" (http://ow.ly/ta9Ai ) JK--Automated sensemaking

Today's "only Seattle artists" day on #KEXP reminds what a stunning diversity of hi-quality music continues to flow from the Emerald City.

RT @kexpplaylist #kexp Lion's Mouth by Arthur & Yu JK--"Oh my fingers in your buttons are like kissing cousins making fabrics come undone."

"#BigData’s Dangerous New Era of Discrimination" (http://ow.ly/ta6tQ ) JK--Michael Schrage on the dark side of demographic segmentation

"The Data Storm: Retail & The #BigData Revolution" (http://ow.ly/ta5HW ) JK--Interactive report from The Economist's Intelligence Unit.

"Fundamentals o Good Data Narration" (http://ow.ly/ta5cv ) JK--Eric Swayne: "Critical not to confuse the visualization WITH the insight."

"Why data science matters to Foursquare" (http://ow.ly/ta4UQ ) JK--Location-contextual notifications boost user time on check-in app.

"#BigData in Education: Big Potential or Big Mistake? (http://ow.ly/ta2Qj ) JK--Potential: personalization, feedback, tracking, assessmt

"#Bigdata ... fast data w/BlueDBM" (http://ow.ly/ta0VE ) JK--MIT prototype multi-node DB, networked flash devices, "near real-time query"

"Was Eliot Ness a Hero or Hollywood-Inspired Myth?" (http://ow.ly/t9jm2 ) JK--Chicagoans believe former. Mrs. O'Leary's Cow got a bad rap


Sexy statistics? The vintage kick of old data poured into fresh analytic bottles (http://ow.ly/t9aFE ) Friday #IBM quick-hit

Stupid WSJ headline says retailer mailing to inadvertently offensive address is fault of "big data," when in fact scale of DB is irrelevant

Wife said she mistakenly mispronouncd happy new year in Hakka & came out dirty. Have 2 trust her there. She & another Chinese had good laugh

It's tomorrow now in Asia, so I'll wish Tio Ciu family & friends (phonetically) "sin cha ju i." Happy New Year. Horse.

"The Data Visualisation Catalogue" (http://ow.ly/t7HMn ) JK--Cool compendium of alternative ways of eyeballing & exploring data patterns

I'm a bit jaded about "year of" IT pronouncements. Seems2me, the blogger usually means "year of [whatever I personally happen 2 B doing]."

"Use AR 2 cleanse, analyse, & represent #BigData? [p1]" (http://ow.ly/t6QDi ) JK--3-D visual vetting in real-world geospatial context

"Hal Varian & “New” Predictive Techniques" (http://ow.ly/t6Pys ) JK--"Techs that R more well-known in ML than econometrics or std stats"

"The Two Characteristics of Data Accuracy" (http://ow.ly/t6OSu ) JK--1) Right. 2) Unambiguous. Also. well-formed & consistent format.

@mjcavaretta Looks good. Thanks for the great automotive #bigdatamgmt tweetchat yesterday, Mike!

"Will #Hadoop replace traditional #DW?" (http://ow.ly/t6MAx ) JK--Ford's @mjcavaretta quotes something I blogged 3 years ago. Still valid

New #IBM jk #Infoworld column: "Cognitive computing can take the semantic Web to the next level" (http://ow.ly/t6MeL )

Recommendation engines? Untapped potential of video, image, & gesture analytics in retail showrooms (http://ow.ly/t6Gp5 ) Thurs #IBM q-h

New #IBM jk blog: "Autonomic planet: Distributed intelligence for a self-healing ecosystem" (http://ow.ly/t6FRu )

Preaching to the choir is no sin. It's only a sin if you fail to preach to them about the gospel of harmonizing in the right key.

#bigdatamgmt @SJAbbott @marksalke @ Out of deference to the tweeter from Ford, I opted not to mention their rival: Porter. 1928 model year.

Did intense 2-hour analyst call taking open-ended ad-hoc questions on complex #IBM response. So gr8ly appreciate having my colleagues on it!

"5 ways Washington DC is very different than Silicon Valley" (http://ow.ly/t53ul ) JK--Also, we tend to wear suits.

A8: #bigdatamgmt My car GPS talks to me. It provides endless merriment when mispronounces common words. Moves me deep down in my funnybone

A8: #bigdatamgmt "Talking cars"? Is this like a vehicular Mr. Ed? You mean connected cars?

A7: #bigdatamgmt Most of the #auto #telematics car chatter shouldn't be stored long-term. Most is disposable. Needs to be purged often.

A7: #bigdatamgmt I'd prefer that my in-car #telematics chatter doesn't store the choice expletives I occasionally direct at nearby drivers

A7: data car gener can be divided 2 different classes based on relevance for manuf, dealers, societal, privat, insur... #BigDataMgmt
 Retweeted by jameskobielus

A7: #bigdatamgmt Owners should store/control #telematics chatter assoc with in-car entertainment. Private/monetizable personal asset

A7: #bigdatamgmt Depending on laws, #auto manufacturers should have access to warranty-related #telematics data while in effect.

A7: #bigdatamgmt Will store #auto #telematics "car chatter" in owner's cloud. Ideally, tags will identify data assoc w/each licensed driver

A6: #bigdatamgmt Absolutely DO NOT focus on alerting drivers through smartphone or other gadget apps. Far too distracting. Dangerous

A6: #bigdatamgmt Make full use of the in-car navigation system to GUIDE drivers in real-time (e.g, detour here). Don't JUST alert them.

A6: #bigdatamgmt Use heads-up displays, audio, maps, graphics, large fonts, & even tactile/kinesthetic approaches to send alerts

A6: #bigdatamgmt Regard driver as "functionally disabled"--ie=., eyes, hands, & attention on safe driving. Dont distract w/excessive alerts

A5: #bigdatamgmt Component performance data should B checkpointd in factory & upon delivery. What's design/engineering/manuf's fault?

A5: #bigdatamgmt Key to correl8n is timestamped + geostamped #telematics event data. Fine-grained pinpointing o failure-contributory events

RT @Natasha_D_G A5: Geospatial, time series and machine data analysis is required. #bigdatamgmt #telematics

A5: #bigdatamgmt If you have road/envir/operation sensors in cars, you have complete driving history available to correl8 w/maintenance

A4: #bigdatamgmt #Telematics data can reveal if drivers fail to add oil & coolant, hence stressing engines beyond manufacturer recomm.

A4: #bigdatamgmt #Telematics data can reveal if drivers fail to avoid potholes, hence throw out their alignment & damage undercarriage

A3: #bigdatamgmt Info from vehicles & OLTP systems should be combined with sentiment, #CXO experience, & survey data from owner/drivers

A3: #bigdatamgmt Building & managing 360-degree car "view" requires #InternetofThings + #cloud + #BigData.

A3: #bigdatamgmt Combining event data-in-motion (from in-car #telematics) with at-rest data data requires streaming/Hadoop/NoSQL integr8n

A3: #bigdatamgmt All that car-focused data will be combined, per privacy constraints, by #auto owners, insurers, & dealers.

A2: #bigdatamgmt I expect that autonomous vehicles will be deployed everywhere as continuous road-quality sensor/reporters.

A2: Think in terms of collapsing time & accuracy issues between event and those who did the design / engineering in the event #bigdatamgmt
 Retweeted by jameskobielus

A2: #bigdatamgmt If you have 1000s of cars/drivers feeding road-quality data continuously, provides richer data for diagnosing faster

@mjcavaretta A2: #bigdatamgmt For consumer-owned car, sensors already on vehicles. For traffic-mgt/civil-engineer cars, specialized sensors

A2: Vehicle sensor data w/ warranty claims, production data & other data can ID quality patterns faster than b4 #bigdatamgmt
 Retweeted by jameskobielus

A2: #bigdatamgmt Cars are the advance scouts of any issues in and around the road system. They & their drivers see/feel problems first.

A1: #bigdatamgmt Road-embedded sensors can alert engineers to emerging issues (e.g., bridge stress fractures) before catastrophe hits.

A1: #bigdatamgmt In-car #telematics can provide real-time intel to feed predictions of hazard systemwide impacts (e.g., congestion)

A1: networked traffic signals could get info from vehicles & reroute traffic or change signals for better flow #bigdatamgmt

A1: Lightweight MQTT transport protocol enables real time communications #bigdatamgmt #auto #telematics
 Retweeted by jameskobielus

A1: Combo streaming geospacial data + vehicle sensor data extends driver's awareness beyond range of vision #bigdatamgmt

A1: #bigdatamgmt #Telematics can use in-car imaging, motion sensors, environmental sensors to detect and assess road hazard in real-time

A1: #bigdatamgmt Connected cars should be networked in real-time to regional alerting service. First car on scene alerts the system

@mjcavaretta I stand corrected!

#bigdatamgmt I am deeply honored to be doing this tweetchat. And I hope you all feel honored to have me in your virtual presence.

Special treat on today's #bigdatamgmt tweetchat: 2 Livonia geeks. Myself + Ford's @mjcavaretta. I'll forgive him for being Bentley HS alum.

@Natasha_D_G knows I have her back on the #bigdatamgmt tweetchats. Have topic, will tweet.

40 min to #bigdatamgmt chat: Automated #Telematics: When Cars Talk w/ @mjcavaretta http://ibm.co/1e32vuz  Join me/us! #auto #bigdata #hadoop

"Social Sentiment’s Missing Measures" (http://ow.ly/t4oZl ) JK--Classif by "biz-aligned splits." Sentiment density. Variation. Volatilty

Internet of Things? Instrument the birdies, bees, and other beasties p1: http://ow.ly/t4mPT  p2: http://ow.ly/t4mSm  Wed #IBM q-h

Join me et al on #bigdatamgmt chat: Automated #Telematics: When Cars Talk http://ibm.co/1e32vuz  at 12noon ET today #bigdata #auto

RT @kexpplaylist #kexp Desire Lines by Deerhunter from Halcyon Digest JK--2010. I pound a tweet every time I hear this pounding beat.

Catch me downtown tonight, "Past, Present, Future of #DataScience Education," GWU, WashDC, Funger Hall, Rm 103. One of many Meetup attendees

WashPost article on a "culture of toxic military leaders." I'm sure that career encourages nasty power-besotted SOBs to indulge their worst.

Day greeted me with an inch. I greeted it with a shovel.

Saw "Blue Jasmine" on DVD. Wow! My fave Woody Allen & fave Cate Blanchett film of all. Superb script, direction, & performance.

Fox News respectfully notes Seeger passing. Paraphrasing: "he sang catchy tunes." Roughly what you'd say if Billy Joel'd bit the big one.

"Large scale data analysis made easier with SparkR" (http://ow.ly/t2Oii ) JK--Blog from UC-Berkeley AmpLab

"#BigData Econ" (http://ow.ly/t2Nsb ) JK--Knwldg w/out expertise, scale w/out mass, data = capital, privacy is brand value, semantic econ

"Eclipse Ups Ante for Internet of Things Community" (http://ow.ly/t2N2s ) JK--Now has 13 projects focused on open-source M2M & IoT apps

@GrandpaRobot You seem to have ideas on the topic. Why not write that blog yourself? I'd love to read it.

"#IBM Anncs Glob Cnsltg Pract" (http://ow.ly/t2nsb ) JK--IBM Interactive Exper. Life Event Detection. Behav Pricing. Psycholing Analytics

RT @kexpplaylist #kexp Talking Union by Pete Seeger frm If I Had A Hammer: Sngs o Hope+Struggl JK--1947. Written 1941 by Seeger/Lampell/Hays

Weekly process of forcing my head to revisit data mgt themes I may not have blogged on in a while. Data gov, for example. What's new to say?

"New #IBM Kenexa Talent Suite Taps #BigData" (http://ow.ly/t1RyH ) JK--HR analyz empl work exper, social enggmt, skill dev, indiv intrsts

"Streets o Ann Arbor Will Soon Be Filled w/ Driverless Cars" (http://ow.ly/t1Okr ) JK--Designated drivers for college-town party animals?

RT @ClaireVinent: Predictions for the Internet of Things in 2014 & beyond http://buff.ly/1ljpG7K  via @jameskobielus

"Ask yr data scientists 2 call my data scientists" (http://ow.ly/t1NNu ) JK--Re analytic products, they make/influence purchase decisions

Data monetization? Pay the persons for their personal data p1: http://ow.ly/t1NjG  p2: http://ow.ly/t1NmP  Tuesday #IBM quick-hit

RIP Pete Seeger. Played huge role in putting Americans in touch with their musical heritage. Profile in courage 4 resistance to McCarthyism

RT @dangillmor: Tom Perkins bravely wages his campaign to be seen as California's most clueless ultra-rich asshat.

RT @kexpplaylist #kexp Steppin' Out by Paul Revere & th Raiders frm Nuggets Vol 2 JK--Gr8 rockers undermined credibility with hokey costumes

"When to use Pig Latin vs Hive SQL?" (http://ow.ly/t0b06 ) JK--Good guidance for #bigdata analytics developers using #Hadoop #MapReduce

"Gluster Vs. Ceph: Open Source Storage Goes Head-To-Head" (http://ow.ly/t09kG ) JK--Useful checkpoint on these communities.

#Criteria for high-quality, efficient #statisticalmodels begin with truth and beauty http://ibmdatamag.com/?p=12365  by James Kobielus

A8 U don't need #data 2 prove every pain point but most of the time, ideas w/o #data are just conjecture. #CXO

A8: it's simply being ignorant. Not doing so allows your competitors a huge opportunity to poach your customers #cxo

A8: #cxo If you grow tone-deaf from not engaging in multichannel listening, you'll slowly wither. May not die. But may not grow.

A8: #cxo Where cust exper is concerned, U wont perish if dont have #bigdata. But you'll grow tone-deaf. Multichannel listening is key.

A8: #cxo Neither true nor false. Data scale not the issue. Competitive survival is all about foresight, agility, mgmt, & resources.

A7: Supply chain data on second- and third-tier suppliers can provide competitive advantage. #cxo

A7: #cxo Your data & your data scientists may B unexceptional. Your advantage may driving data-centric decision suppt throughout your biz

A7: #cxo Your data alone may not be your competitive advantage. Rather, your key asset may be data-scientist smarts for analyzing it all

A7: #cxo Compet adv is in unique value-prop difficult/costly for others to match. Your operational data may be unique asset. Mine it!

A6: #cxo In terms o churn risk, use data (survey, call ctr, etc.) to identify which high-CLV customers most likely to jump. Target them

A6: #cxo Customer survey and social influence data can reveal who drives sentiment, hence retention/sales/etc.

A6: #cxo Customer data can readily reveal who has highest CLV historically. Predictive analytics can help forecast CLV.

A6: #cxo Focus on a) customers w/high lifetime value, b) customers with greatest influence over other customers' behavior/opinions.

A5: #cxo Using data to pre-empt churn: a) analyze competitive price-points, b) assess own vs rival solutions, c) ID why U piss 'em off

A5: #cxo Pre-empting churn: a) reduce price, b) boost value, c) provide superior customer svc/engagement.

A5: #cxo Main churn drivers include a) you're too expensive, b) don't offer competitive value, & c) have alienated customer

A4: #cxo Use data you acquire during sales process to adjust your approach to woo each individual customer. 1:1 personalization.

A4: #cxo Direct marketing = targeting your message to the most promising new segments. That demands market research data

A4: #cxo Sales prospecting depends on data on qualified leads. That's fundamental to customer acquisition. Hot prospects?

A3: #cxo Perhaps leverage customer churn data. What "matters most" may be what the competition's selling.

A3: #cxo You should start by asking what overall experience "matters most." Data to answer that? Are prod/svc features priority?

A3: #cxo Of course, "matter most" starts w/ what they're buying. Purchasing data. Also, data on what prods/feats they're requesting

A2: #cxo Do real-world experiments & A/B testing of diverse variables across different channels. Measure differential #CXO lift.

RT @TabithaDunn #CXO A2: Ethnographic research into customer moments of truth can help prioritize what data matters most. #cxo

A2: #cxo Incorporate data on social influencers. Perhaps brand value perception shaped more by them than by your channels.

A2: #cxo Gather & compare data on customer interactions on various channels. Which channels drive sales? Which drive satisfaction?

A1: #cxo Data may be lacking to reveal cust "reachability" pref if you've never given them altenative. They grin&bear your current chan

A1: #cxo Data on past interactions can reveal if responding as U wish. But if have new custs or channs, past data not much help

A1: #cxo Data can help you determine which were the most EFFECTIVE ways to reach customers, in terms of response, acceptance, satisfaction.

@IBMbigdata #cxo A pleasure once again to be driving your tweetchat experience for the next hour or so.


Drafted latest #IBM jk blog: "Autonomic planet: distributed intelligence for a self-healing ecosystem"

RT @kexpplaylist #kexp So Says I by The Shins from Chutes Too Narrow JK--2003. Love this song. Have never understood the Sir Thomas More ref

Workday stress relief involves clearing my workspace of tasks. Do it, lose it. Like training with superheavy weights. Burdens test the heart

"ROI of Data Governance" (http://ow.ly/sZgVU ) JK--Good quantifiable metrics. What's a "tranche"? Don't get all French on us suddenly.

"Why video is next big thing in #bigdata" (http://ow.ly/sZfwt ) JK--Some #IBM brainiac called this "Big Media" (http://ow.ly/sZfO7 )

"#BigData: Retailers, Supermkts, Med Mkts All Dive In 2 Extract Info From & About Consumers" (http://ow.ly/sZdAz ) JK--US lagging Europe?

Columnist (Net Wd, mostly), analyst (various), evangelist (IBM): that's the progression of my career as an IT industry opinion-slinger. Hmm.

"Social Media Data, Sensor Data, Open Data can make..." (http://ow.ly/sZcty ) JK--Discusses #IBM & other #SmarterCities initiatives

"Using #BigData to Ask Big Questions" (http://ow.ly/sZbPd ) JK--Explore larger issues whose data-centric answers not evident 2 naked eye

"Wht Hse study #bigdata. 5 things it should know" (http://ow.ly/sZatl ) JK--"3. Big means nothing." Bingo! Privacy vulnerable @ any scale

"White House Launches #BigData, Privacy Review" (http://ow.ly/sZ9Ya ) JK--Needs to engage UN, other nations in the discussions.

Morning's overstuffed inbox. Are newly received emails the crumbs U sweep off yr desk 2 start your day? Or are they the meat of what awaits?

Mornings are just matter of waking & slapping yourself back to the land of the living. Quotidian rebirthing. Hopefully absent the screaming

Morning is gray and freezing. Oh...it's January. I'll keep my expectations in check. It's a perfectly seasonable day, then.

RT @kexpplaylist #kexp Lost Boys & Lost Girls Club by Dum Dum Girls from Too True JK--2014. Excellent new song. They do Letterman this week

RT @kexpplaylist #kexp Bulldozer Love by Blank Realm from Grassed In JK--Cool. Sounds sorta like Tom Verlaine fronting Dandy Warhols

Smarter planet? Intelligence for a self-healing landscape p1: http://ow.ly/sYZdY  p2: http://ow.ly/sYZhH  Monday #IBM quick-hit

RT @kexpplaylist #kexp Salvador Sanchez by Sun Kil Moon from Ghosts of th Great Highway JK--2003. Brilliant grinding drone. Achieves majesty

Saw 2001 "Doc Martin" film that preceded TV show. Essentially, diff character, slightly diff name, same actor. Nicer. Prefer the curmudgeon

Well, that's two French presidents in a row who've split with their significant other mid-term. Surprised it didn't happen to the Clintons.

Another Sunday yoga teaches comes and goes, due to conflicting life priorities. What? This isn't their whole life? I'm shocked.

Met a longtime #IBM-er who was born & raised in Endicott NY. Sorta like meeting a Hershey Foods employee from Hershey PA. True Blue!

When some Facebook newbie posts in ALLCAPS, I'm wondering if it's from a Wang wordprocessor circa 1978.

Some called Newton's gravity "spooky action at a distance." You can think of quantum mechanics as spooky action close up.

Danny Kaye. "CBS Sunday Morning" profiling this all-around superb entertainer. Even his final TV performance ("Cosby Show") was memorable.

The news that IBM is disposing of its x86 server business to Lenovo for about $2.3 billion comes as no shock. http://bit.ly/1fe4pXY 

Excited for tomorrow! Emceeing at #IBMConnect and get to introduce legendary @geoffreyamoore and the Queen of #socbiz @Sandy_Carter

Fingering my stubble. Sunday's one of my shaving days. Can't start the week without mowing the facial lawn.

Planning stuff I want to accomplish this year, work-wise. Hmm. Unusual for me over Sunday breakfast. Normally I drift at this time.

RT @shawnrog: 5 Laws of #BigData Startups - Analyst rant on Big Data startup briefings & strategy http://ow.ly/sNiKD  #bigdata #startup

RT @TheDailyShow: Group of world's wealthiest people gather in secluded mountain enclave 2 discuss income inequality. http://on.cc.com/1jtrVDD 

The only Grammy contest that I have an opinion on this year is that Macklemore & Ryan Lewis should get Song Of The Year for "Same Love."

New jk #IBMDataMag article: " When and When Not to Have Faith in Statistical Models – Part 1" (http://ibmdatamag.com/2014/01/when-and-when-not-to-have-faith-in-statistical-models-part-1/ …)

WSJ review of WSBurroughs bio. I saw him onstage in Madison in 1984. Weird creepy unhinged incoherent ranting maniac. And that's being kind.

Can we drop the "also known as Lou Gehrig's disease"? This ain't 1941. By this time, every1 knows it's called amyotrophic lateral sclerosis.

"Stephen Hawking Proposes Radical New Theory of Black Holes" (http://ow.ly/sVBeV ) JK--Link is abstract of his paper awaiting peer review


My primary objective in consolidating many contributors' inputs to a document is to guard against broth spoilage. I keep tasting the soup.

Friday, January 24, 2014

Aweekstweets January 18-24 2014: the week it stayed subfreezing

"#Hadoop 2.0's deep impact on #bigdata techs" (http://ow.ly/sVwCN ) JK--Nicole Laskowski rounds up the analysts

"#BigData Debate: Will HBase Dominate NoSQL?" (http://ow.ly/sVwhV ) JK--Will? It's not dominating it now. Debate premise is a bit off.

Catch me #IBM @fhalper #TDWI Krish Krishan, et al on #BigData webinar on Tues Feb 11, 12-1pm EST (http://ow.ly/sVtSD ).

Video: The Retail Equation predicts and shapes shopper behavior with #IBM #PureData for Analytics (http://ow.ly/sVsRF )

Stupidest LinkedIn message is "a LinkedIn member viewed your profile." Exactly what am I going to do with that information?

U can tell IoT is hot by the growing surge of cautionary pushback, eg http://ow.ly/sVfft  http://ow.ly/sVfjy  http://ow.ly/sVflM 

#CXO is back Mon 12 ET! #Data 4 #CX Advantage w/ @StephanieThum, @WilliamMcKnight y yo Señor Diego Kobi http://on.fb.me/1dTdC9e  #bigdata

Occasionally a typo coins a gr8 new word. Discussing historical archives, I was thinking of "yesterday's data" & it came out as "yesterdata"

"How Machine Learning Could Result In Great Applications for Your Business" (http://ow.ly/sUwZu ) JK--Good discussion of specific apps.


Meaty metadata? Data variety leads to metadata viscosity p1: http://ow.ly/sUvAo  http://ow.ly/sUvBW  Friday #IBM quick-hit

RT @kexpplaylist #kexp Tomorrow is a Long Time by Phosphorescent from Sweetheart 2014 JK--A fine cover of a simple soulful Dylan classic.

"Infinity AR: We'll fulfill sci-fi promise of augmented reality" (http://ow.ly/sUret ) JK--Curious why they demo privacy-invading AR app

Ejected DVD of Leonardo Di Caprio "The Great Gatsby" a half-hour in. Hyperstylized junk. Hiphop in soundtrack of the Roaring '20s? WTF?!

RT @caro: I'd be OK if law pardoned Rob Ford & Justin Bieber on the condition that they recorded duet of "Paradise by the Dashboard Light."

Have drafted 4 of next week's 5 quick-hits. Now I'll breathe and knock off for the evening.

"Big Data & Clinicians: Review on State of Science" (http://ow.ly/sSVmo ) JK--In-depth discussion by Stanford School of Medicine rsrchers

"Framework to build logistic regress model in rare event population" (http://ow.ly/sSULH ) JK--Interesting step-by-step from data scntist

Responded to email interview on #bigdata by ODBMS Industry Watch.

RT @kexpplaylist #kexp You Are the Generation That Bought More Shoes & You Get What You Deserve by Johnny Boy JK--2006. Gr8 song. Long title

Drafted latest #IBM AnalyzingMedia blog: "Pay Your Customers What They're Worth to You"

"How Government Can Make Open Data Work" (http://ow.ly/sSt4e ) JK--Good detailed practical ideas.

"The internet of things needs a new security model. Which one will win?" (http://ow.ly/sSsyl ) JK--It doesn't have an old model. Up4grabs

New #IBM jk blog: "#BigData overkill can stunt scientific rigor" (http://ow.ly/sS2pq )

"What’s the Lift of Your Churn Model?" (http://ow.ly/sRW3v ) JK--Popular pickup line in bars frequented by data scientists.

"Three myths about data scientists and #bigdata" (http://ow.ly/sRV9Z ) JK--New word (new to me at least): "polyvalent." Means "versatile"

RT @kexpplaylist #kexp Dreams by The Electric Peanut Butter Company from Trans Atlantic Psych Classics Vol. 2 JK--2013. Fleetwood Mac cover

"Lenovo Plans 2 Acq #IBM x86 Svr Biz" (http://ow.ly/sRNKJ ) JK--IBM keeps Sys z, Powr Sys, Stor Sys, Pwr-bsd Flex svrs, PureApp, PureData

Context accumulation? Narratives drive home relevance of stat models p1: http://ow.ly/sRNmb  p2: http://ow.ly/sRNnq  Thurs #IBM qh

In the Middle Ages the English referred to the Black Death as the "Great Death." I find that even more chilling.

RT @kexpplaylist #kexp Natural One by Shearwater from Fellow Travelers JK--2013. Great cover of Folk Implosion's 1995 original.

"Apache Spark: The Next #BigData Thing?" (http://ow.ly/sQwdi ) JK--I'm most interested in its streaming model. See mid-article.

"The future of storage: disk-based or just discombobulated?" (http://ow.ly/sQw20 ) JK--Discusses recent M&A in the storage market.

"New Techniques Detect Anomalies in #BigData" (http://ow.ly/sQvHL ) JK--Discusses anomaly detection relevant 2 IoT/machine data analytics

"Can #BigData & SQL Get Along?" (http://ow.ly/sQvly ) JK--Discusses pros and cons of SQL in a NoSQL environment.

Lookee here: 15 balmy degrees in the middle of the afternoon. Well, at least it's sunny. Cold consolation.

New jk #IBM #Dataversity article: "The Beauty Metric: Choosing the Best-Fit Advanced Analytic Algorithms" (http://ow.ly/sQl18 )

A8: u don't need a schema 2 enforce security, u need a better security tool that monitors & secures Hadoop/NoSQL #bigdatamgmt
 Retweeted by jameskobielus

A8: #bigdatamgmt You can have flexibility 2 use any #bigdata platform U wish. As long as plats individually & jointly are secure

#BigDataMgmt A8: #Hadoop is not the issue. The lack of a holistic approach to #BigData #Security & #Privacy is. End the Patchwork!
 Retweeted by jameskobielus

A8 Security tech needs 2 adapt 2Hadoop/NoSQL - not the other way round. #bigdatamgmt
 Retweeted by jameskobielus

A8: @merv suggests AAAA approach: authentication, authorization, audit, anonymization http://gtnr.it/1kV9bi6  #bigdatamgmt
 Retweeted by jameskobielus

A8: #bigdatamgmt Sure. But that demands that #privacy #security be addressed ALL the time into ALL planning and ops for ALL #bigdata

+1 RT @DCorrigan: A7: Also need to identify sensitive data, and know it's lineage and what it's used for #bigdatamgmt
 Retweeted by jameskobielus

#bigdatamgmt A7 Centralization => Risk of bigger failures, bigger bullseye on back
 Retweeted by jameskobielus

A7: #bigdatamgmt If #Hadoop deployed for ETL into DW augmentation role, & if DW holds PII, Hdp ops must also protect #privacy

A7: #bigdatamgmt If data in #Hadoop clusters not PII, I have no concerns. If it's PII, is there implied consent (eg social media data)?

#BigDataMgmt: A7: #Hadoop was not Designed for #Privacy. Vendors commercializing this IP must address it. #BigData will #fail without it.
 Retweeted by jameskobielus

A7: Partly No, Appropriate configurations & Apps done right could provide enterprise grade security in #hadoop #bigdatamgmt
 Retweeted by jameskobielus

A7: #bigdatamgmt My concern is not with security of #Hadoop tech. It's with business' lack of data stewardship controls around Hdp data

A6: #bigdatamgmt Sheer proliferation of DBs containing PII is one of the biggest barriers to #privacy compliance. Find it all first.

A5: #bigdatamgmt The bigger the business is, the more likely to be target of attack. But zombies target businesses at all scales.

A6: #bigdatamgmt Lack of unified PII governance under #MDM is big barriers to #privacy compliance. PII data not yet harmonized

A6: #bigdatamgmt Failure of a company to have chief privacy officer or privacy data stewards is a tech barrier. Who's responsible

A6 - fragmented security technologies 4 each data platform. U need 1 security technology 2 monitor data in Hadoop, RDBMS, NoSQL #bigdatamgmt
 Retweeted by jameskobielus

A6: Different IT infrastructure & associated data policy makes it harder to achieve privacy compliance #bigdatamgmt
 Retweeted by jameskobielus

A5 Orgs need to work on assumption they've already been breached. Korea = inside job; Target unnoticed for too long, etc. #bigdatamgmt
 Retweeted by jameskobielus

A5: Biggest data security threats come from inside, report says http://qub.me/hcuplY  #bigdatamgmt
 Retweeted by jameskobielus

A4: Remember that our brain wiring fights us here. I wrote chapter on this here http://bit.ly/1cVtGBh  #bigdatamgmt #bigdata, #privacy
 Retweeted by jameskobielus

A5: Accidental disclosure tops hackers! #Forrester says 36% of breaches are internal http://bit.ly/1kVek9P  #bigdatamgmt
 Retweeted by jameskobielus

A5: #bigdatamgmt Accidental disclosure is always a threat. The more complex & dynamic a biz, more likely someone is to slip up

A5: #bigdatamgmt Cluelessness. It's too easy for biz to take for granted data they hold & misuse it without awareness of consequences

A4 - The punishment will be a silent killer - how many customers avoided you because of privacy issues? #bigdatamgmt
 Retweeted by jameskobielus

A4: Using the power you hold as a consumer is the best way. Example: Target, you won't find me shopping there any time soon. #bigdatamgmt
 Retweeted by jameskobielus

A4: #bigdatamgmt Shaming is a punishment in a reputation-sensitive economy. May not be easy to quantify the bottomline impact, though

A4: Target had to make an official statement, as quarter earnings materially affected. #bigdatamgmt
 Retweeted by jameskobielus

@IBMbigdata: A4: 2013 Ponemon Institute Data Breach Study says avg cost of a breach to a US org is $5.4M. #bigdatamgmt
 Retweeted by jameskobielus

RT @IBMbigdata A4: Here’s link to Ponemon Cost of Data Breach Study http://bit.ly/1kVcSEE  #bigdatamgmt
 Retweeted by jameskobielus

A4: They lose #trust, loyalty, and instill fear in the consumer. #bigdatamgmt Still afraid to shop at #Target...
 Retweeted by jameskobielus

A4: #bigdatamgmt If by "punish" you're referring to expense of privacy litigation, damage control, & retooling, many biz have suffered

A4: #bigdatamgmt Good question. Has any privacy-deaf business ever suffered significant customer churn? I haven't heard of any.

A3: #bigdatamgmt #Privacy is #CX concern just as much as a #legal concern. You don't want to creep out customers w/ privacy invasions

A3: Orgs should rely on policy & law as a MINIMUM! Great orgs go beyond. #bigdatamgmt
 Retweeted by jameskobielus

A3 No! Work with customers to set reasonable policy, limits, uses. And be transparent. #bigdatamgmt
 Retweeted by jameskobielus

A3: #bigdatamgmt If you stay in customer comfort zone #privacy, you're mitigating risk of lawsuits. Be more stringent than law requires

A3: #bigdatamgmt Your org should consider customer "experience" sensitivities, PLUS law/regulations, in setting #privacy practices.

A2: #bigdatamgmt IMHO, users in online society should own/withhold some PII. It's a monetizable personal resource.

Individuals own their Digital Identities, but must be ever-vigilant in managing and protecting them to maintain any #Privacy #bigdatamgmt
 Retweeted by jameskobielus

RT @DCorrigan: a2 things are changing in EU - consumers own data, and have right to have it deleted. http://wp.me/2kNLp  #bigdatamgmt
 Retweeted by jameskobielus

A2: #bigdatamgmt If consider who SHOULD own PII, some feel individual should. But when gen'd in transactions, shouldnt biz share ownership?

A2: #bigdatamgmt The matter of data ownership is also obviously jurisdiction-specific. But laws may be vague in some places. Risks abound

A1: #bigdatamgmt The notion of "implied consent" must be considered on some types of data. HIre a good lawyer to advise you.

A1: #bigdatamgmt If the data you're collecting is NOT PII, there may still be relevant restrictions (eg regarding transborder data flows).

A1: #bigdatamgmt If the data is personally identifiable info, you need to consult relevant laws before collecting, managing, or using

Agreed! RT @IBMAnalytics: Grt point RT @Natasha_D_G A1: Social data = public data and thus orgs can collect at will #bigdatamgmt #privacy
 Retweeted by jameskobielus

A1: Open public data and data with waived rights (certain #CreativeCommon ) should float without consent #NotRelevantToNSA #bigdatamgmt
 Retweeted by jameskobielus

A1: #bigdatamgmt Depending on laws of the jurisdiction you operate in. The safe answer is "never." Get consent 1st before you collect PII

@IBMbigdata As always, I stand ready to tweet with all the pretentious but concise erudition that I can muster.


Maximize the value of your Systems of Engagement w/ IBM's Engagement #Analytics, http://www.ibm.com/engage  #bigdata #socbiz #ibmconnect

Apple's stupid "Lemmings" commercial from the 1985 Super Bowl (http://ow.ly/sQ59j ). Did Disney really license "Heigh Ho" for this?


Whoa! I just found a legitimate reason to reference Schopenhauer in a blog. All my smartypants pretentious erudition rising to the surface.

Ah yes, coordinating #IBM-wide response 2 an analyst firm. Not dissimilar 2 coordin8ng analyst-firm-wide respns 2 #IBM, which I've also done

RT @kexpplaylist #kexp A New England by Billy Bragg from the Peel Sessions JK--1991. This is his rawest and best version of this great song

NoSQL? The architecture that's still curiously absent p1: http://ow.ly/sPAYH  p2: http://ow.ly/sPB1c  Wednesday #IBM quick-hit

Hard to take an interest in the upcoming Winter Olympics. An armed athletic encampment under the control of homophobes? Go to hell, Putin!

Aging in your career. You know it's happening when people (and you yourself) routinely refer to you as a "veteran." Yikes!

Ever look at a newly snowy landscape and immediately lose memory of how it appeared pre-snow? You wonder if it's the same place underneath.

Crazy-busy day today. But made measurable progress on everything. That's all I seek. Now: freshfallen snow to remove.

"3 innovs 2 xform mob data analytx" (http://ow.ly/sNW09 ) JK--"Once bridg digital + phys, no way restrict possibs o val data & event gen"

"The number crunch: Will Big Data transform your life?" (http://ow.ly/sNVtL ) JK--Closes w/discussion of #IBM "computational creativity"

Drafted next #IBMDataMag article: "Truth, Beauty, Authority, and Efficiency: When to Place Faith in Statistical Models (and When Not)"

"Will these tiny computers herald the arrival of Internet of Things?" (http://ow.ly/sNDiH ) JK--Teensy-weensy platforms for thing devel.

"DIY Internet of things: Ultimate maker project" (http://ow.ly/sNCEG ) JK--"Homebrew" thing development? Who'll be the Jobs/Woz of IoT?

"2014: An Internet of Things Odyssey" (http://ow.ly/sNBr2 ) JK--Fun fact: the monolith was God's lost smartphone calling home.

Ze snow she is falling!

RT @kexpplaylist #kexp Transmission by Joy Division frm Substance JK--"Radio live transmission"? No. Studio recrdng. Duh. Dance dance dance!

WashPost article about SilVal firms decamping 4 DC area. Focuses on wireless vendor. Yeah we have that depth. I was w/ WL tool vendor in 90s

Trying to think of a cute hype-y nickname for the coming snowstorm. Hmm. BlizzKrieg. Yeah. That's the ticket!

RT @kexpplaylist #kexp Blue Moon by Beck from Morning Phase JK--2014. Wow! Fresh Beck! And it's good stuff (naturally).

Waiting for the snow to begin falling. It won't be proverbial "snow day" for me. I still work--and doubly so--i.e., my job + the shoveling

RT @kexpplaylist #kexp Sleeping Gas by The Teardrop Explodes from Kilimanjaro JK--1980. Liverpool/Julian Cope. Similar 2 Echo & The Bunnymen

RT @kexpplaylist #kexp Silver Timothy by Damien Jurado from Brothers & Sisters of Eternal Son JK--2014. Odd, endearing voice like Neil Young

Security of #bigdata? Shoddy lifecycle management is ironic data security p1: http://ow.ly/sN41i  p2: http://ow.ly/sN433  Tues #IBM qh

Mahinder told the party last night that he was impressed with my prior knowledge of Turkey's ancient history. Yes, I'm a total history geek.

"Internet of Things: Do Customers Have Say in Who Owns Their Data?" (http://ow.ly/sLWOF ) JK--Misleadingly implies B2C is only IoT domain

Right now, Weather Channel says 57 ° in NoVa. Tomorrow they predict 5-7 inches of snow. I'm sure the 57-5-7 stat correlation is significant

When some1 sends me list of tech Qs outa nowhere, saying "I look forward to yr response," they can look forward to eternity. Wot in it 4 me?

"Making sense o social nets, social graphs, collab, & cooperation in new way o work" (http://ow.ly/sLRIb ) JK--"Leanership"? Oh, brother!

Saw DVD of 1999 film "Angela's Ashes" about desperately poor Irish people pining for some magical fantastical wonderland called "America."

The Box Tops "Cry Like a Baby" (http://ow.ly/sLGQs ) JK--Recorded in Memphis. From Memphis. #2 on charts on 4-4-68.

Louis Armstrong "Ain't Misbehavin'" (http://ow.ly/sLATP ) JK--One of biggest hits the year MLK Jr was born. 1929.

Otis Redding "Sitting on th Dock of Th Bay" (http://ow.ly/sLzRV ) JK--#1 song on 4-4-68. Died 10-10-67. Plane crash Lk Mendota Madison WI

10,000 Maniacs "A Campfire Song" (http://ow.ly/sLpIa ) JK--1987. Cool duet of Natalie Merchant & Michael Stipe

REM "Begin the Begin" (http://ow.ly/sLk9e ) JK--"Let's begin again, like Martin Luther Zen." Gr8 nonsense. I met these guys in Madison.

Drafted my latest #IBM #Dataversity blog: "The Beauty Metric: Choosing the Best-Fit Advanced Analytic Algorithms"

Drafted latest #IBM blog: "#BigData Overkill Can Stunt Scientific Rigor"

RT @kexpplaylist #kexp After the Disco by Broken Bells from After th Disco JK--2014. JMercer boasts 2 awesome & distinct groups (this+Shins)

"What analytic managers can learn from neural nets" (http://ow.ly/sL8ZG ) JK--"Neural mgmt....all data scientists perform all functions"

"Tim O'Reilly on open data" (http://ow.ly/sL8t4 ) JK--"When cost is low enough, it ...create[s] many of the same conditions as a commons”

"Human Analysts at Superhuman Scales" (http://ow.ly/sL83u ) JK--Discusses their graph analytics programming framework for genomics

"PredMod w/#BigData: Bigger Really Better?" (http://ow.ly/sL7lb ) JK--“Crtn telling behavs may not B obsrvd in suff #s w/out massiv data”

"Markov Logic Networks" (http://ow.ly/sL6qe ) JK--"Situations where outcomes partly random and partly under control of a decision maker"

"How NLP Makes Lives Easier (http://ow.ly/sL1K8 ) JK--"Ultimate goal o NLP is 'do away with computer programming languages altogether'"

Advanced visualization? Using visuals to discriminate among machine-learning algorithms (http://ow.ly/sKWfr ) Monday #IBM quick-hit

RT @kexpplaylist #kexp Red Eyes by The War on Drugs from Lost in the Dream JK--2014. These guys are good. Real solid rock with depth.

Weirdly, my 1st (crappy) job out of grad school gave me a useful skill. Reader in Wisconsin Newspaper Assn clipping bureau. Speed-gleaning.

Dinner party last night w/friends Mahinder & Harban who we met on Turkey trip last month. Reminisced about recent memories of ancient ruins

Saw article about how data, supposedly, is "evil." Article discussed how practices (collecting+using data) might qualify. Data itself? No

Congrats to Seattle and Denver. Super Bowl parties in both cities will be historic. Legal doobies will be lit to celebrate.

OK you numchuk ex-jock broadcasters: testosterone-fueled shouting of football cliches & truisms is NOT the same as analysis.

Sandy aid. Was it Christie's slush fund to reward NJ mayors who supported his re-election bid & punish those who didn't?

"Punk"? "Post-punk"? For the sake of exhaustive categorization, I'm going to insist that we label all rock before 1976 as "pre-punk."

"CBS Sunday Morning" on autonomous vehicles. These will offer unparalleled mobility for the disabled. Wheelchairs will dock inside.

"CBS Sunday Morning" examination of doodling. It's just a tactic to stave off boredom inflicted by people telling you nothing new or useful.

Reading Nicholas Ostler's excellent "Empire of the Word: A Language History of the World." Especiialy enjoy his theory of language spread


I love the elegance of Windows' design, best exemplified by the command "Force Shutdown."