On 22/05/2018 1:42 AM, Kellie Fitton wrote:
Hi Folks.
I cobbled together a program to perform data mining on a large
collection of sizable ISAM files. The algorithms will mine the
warehoused data for relevant statistics, and generate predictive
analytics to guide management decisions and measure performance.
However, extracting real meaning from data can be challenging,
fiendishly complex to understand, and wildly counter intuitive.
Major factors to consider are: bad data, flawed processes and misinterpretation of results can produce false positives and
negatives, which can lead to inaccurate conclusions and
injudicious business decisions. I would like to know your
professional opinions with the following questions:
1). in social sciences, is it practical or useful to develop
a predictive model?
Yes. BUT, there are some provisos...:
(These provisos apply to changing the alpha factor on an ancient
Inventory Control system to reflect seasonal demand fluctuations, constructing a deep neural network for a specific AI application and
loading 10s of millions of data points into it, or to a simple self-modifying heuristic programming example; in other words, ANY kind
of software where the "results" are predicated on previous results and modified within desirable constraints:)
1. The predictions (no matter what the algorithm claims) should be considered to be accurate to within 50%. In other words, the model can
be used to give a "general likelihood" of what is going to happen.
2. No financial risk of any kind must be taken, based on the prediction.
3. The rules above don't get changed if the model is within 10% (Unless
it is run across at least 1000 datasets and ALWAYS predicts within 10%
of the actual outcome.) In other words, the "credibility" of the model
may improve but that doesn't alter the rules for using it.
2). Are there any ironclad guarantees around predictive models?
No.
But that doesn't mean they are worthless.
Some classes of problem can ONLY be solved by a computer using
heuristics or AI, because it would take longer than the time available,
to solve them using traditional methods.
If a heuristic model finds its way through a complex maze, the solution
may not be the BEST one, but it is better than NO solution.
If an AI net predicts cases of cholera within 5 miles of your location,
you might well laugh it off but you'd probably renew your vaccination,
just in case...
This whole field is expanding rapidly and it is likely that much more reliable predictions will be available within the next few years. It
might then become possible to relax rules 1 and 2, but for now you
should treat the output from a predictive model with extreme skepticism, even when it gets it pretty much right...
It's like the "Pirate Rules" in Pirates of the Caribbean; more of a "guideline", really.
Pete.
--
I used to write COBOL; now I can do anything...
Hi Folks.
I cobbled together a program to perform data mining on a large
collection of sizable ISAM files. The algorithms will mine the
warehoused data for relevant statistics, and generate predictive
analytics to guide management decisions and measure performance.
However, extracting real meaning from data can be challenging,
fiendishly complex to understand, and wildly counter intuitive.
Major factors to consider are: bad data, flawed processes and >misinterpretation of results can produce false positives and
negatives, which can lead to inaccurate conclusions and
injudicious business decisions. I would like to know your
professional opinions with the following questions:
1). in social sciences, is it practical or useful to develop
a predictive model?
2). Are there any ironclad guarantees around predictive models?
Thank you for your feedback.
Hi Doc Trins O'Grace,
I appreciate your informative feedback. I interviewed some
of my friends who are working as engineers and their answers
rather surprised me. They said to increase the percentage of
accuracy and clairvoyant logic of their predictive analytics,
they must leverage the power of a new wrinkle in their field,
and it is the reliance on Machine Learning and AI, Artificial
Intelligence.
I find it shocking that the most sophisticated
predictive software can become fully non-predictive in just
two weeks period, due to the complexity, uncertainty and
unpredictability of our connected world. case in point, the
collapse of financial services firm Lehman Brothers, and the
great recession of 2007 that was not predicted by economist
who are trained to forecast and uses predictive analysis.
Hi Doc Trins O'Grace,
You are absolutely right -- the trouble is knowing your inputs.
I must ask my self are the data good and make sure the data used
as well as the processes that generate and organize it are of
the highest quality and fully understand them. I don't want to
spend long time and resources only to find a bug in the data.
One can still get problems even with the best data. Garbage in,
garbage out. Predictive analytics are risky by nature, they are
valid as long as the input data are also valid.
On 23/05/2018 8:07 AM, Kellie Fitton wrote:
One can still get problems even with the best data. Garbage in,
garbage out. Predictive analytics are risky by nature, they are
valid as long as the input data are also valid.
Sadly, no, they are not.
In article <[email protected]>,
pete dashwood <[email protected]> wrote:
On 23/05/2018 8:07 AM, Kellie Fitton wrote:
[snip]
One can still get problems even with the best data. Garbage in,
garbage out. Predictive analytics are risky by nature, they are
valid as long as the input data are also valid.
Sadly, no, they are not.
Mr Dashwood, it seems that folks no longer study the Hawthorne effect.
DD
On 25/05/2018 6:12 AM, [email protected] wrote:
In article <[email protected]>,Sorry Doc, not sure of your allusion here. My position has been
pete dashwood <[email protected]> wrote:
On 23/05/2018 8:07 AM, Kellie Fitton wrote:
[snip]
One can still get problems even with the best data. Garbage in,
garbage out. Predictive analytics are risky by nature, they are
valid as long as the input data are also valid.
Sadly, no, they are not.
Mr Dashwood, it seems that folks no longer study the Hawthorne effect.
consistent throughout the thread (whether it was observed or not... :-)):
"Don't trust the results of analytics."
| Sysop: | DaiTengu |
|---|---|
| Location: | Appleton, WI |
| Users: | 1,123 |
| Nodes: | 10 (0 / 10) |
| Uptime: | 37:32:35 |
| Calls: | 14,371 |
| Files: | 186,380 |
| D/L today: |
3,843 files (1,077M bytes) |
| Messages: | 2,540,675 |