Deleting missing values from a dataset












2












$begingroup$


I wanted to create a dataset of all UFO sightings in April of 2018. The
AssociationThread doesn't want to work, because some of the entries are missing value, what is the best way to fix it?



dataA1 = Import[
"http://www.nuforc.org/webreports/ndxe201804.html", {"HTML",
"Data"}];
dataA2 = Flatten[Rest@dataA1, 1];
dataA3 =
Map[AssociationThread[First[dataA1], #] &, dataA2]
Dataset[dataA3]









share|improve this question











$endgroup$








  • 1




    $begingroup$
    To be clear, this isn't about data that is Missing, but rather Import[XXXX,{"HTML", "Data"}] giving "tables" that have rows with inconsistent lengths.
    $endgroup$
    – Carl Lange
    6 hours ago
















2












$begingroup$


I wanted to create a dataset of all UFO sightings in April of 2018. The
AssociationThread doesn't want to work, because some of the entries are missing value, what is the best way to fix it?



dataA1 = Import[
"http://www.nuforc.org/webreports/ndxe201804.html", {"HTML",
"Data"}];
dataA2 = Flatten[Rest@dataA1, 1];
dataA3 =
Map[AssociationThread[First[dataA1], #] &, dataA2]
Dataset[dataA3]









share|improve this question











$endgroup$








  • 1




    $begingroup$
    To be clear, this isn't about data that is Missing, but rather Import[XXXX,{"HTML", "Data"}] giving "tables" that have rows with inconsistent lengths.
    $endgroup$
    – Carl Lange
    6 hours ago














2












2








2





$begingroup$


I wanted to create a dataset of all UFO sightings in April of 2018. The
AssociationThread doesn't want to work, because some of the entries are missing value, what is the best way to fix it?



dataA1 = Import[
"http://www.nuforc.org/webreports/ndxe201804.html", {"HTML",
"Data"}];
dataA2 = Flatten[Rest@dataA1, 1];
dataA3 =
Map[AssociationThread[First[dataA1], #] &, dataA2]
Dataset[dataA3]









share|improve this question











$endgroup$




I wanted to create a dataset of all UFO sightings in April of 2018. The
AssociationThread doesn't want to work, because some of the entries are missing value, what is the best way to fix it?



dataA1 = Import[
"http://www.nuforc.org/webreports/ndxe201804.html", {"HTML",
"Data"}];
dataA2 = Flatten[Rest@dataA1, 1];
dataA3 =
Map[AssociationThread[First[dataA1], #] &, dataA2]
Dataset[dataA3]






import dataset associations web-access






share|improve this question















share|improve this question













share|improve this question




share|improve this question








edited 4 hours ago









Carl Lange

4,61311038




4,61311038










asked 6 hours ago









Artem AnisimovArtem Anisimov

403




403








  • 1




    $begingroup$
    To be clear, this isn't about data that is Missing, but rather Import[XXXX,{"HTML", "Data"}] giving "tables" that have rows with inconsistent lengths.
    $endgroup$
    – Carl Lange
    6 hours ago














  • 1




    $begingroup$
    To be clear, this isn't about data that is Missing, but rather Import[XXXX,{"HTML", "Data"}] giving "tables" that have rows with inconsistent lengths.
    $endgroup$
    – Carl Lange
    6 hours ago








1




1




$begingroup$
To be clear, this isn't about data that is Missing, but rather Import[XXXX,{"HTML", "Data"}] giving "tables" that have rows with inconsistent lengths.
$endgroup$
– Carl Lange
6 hours ago




$begingroup$
To be clear, this isn't about data that is Missing, but rather Import[XXXX,{"HTML", "Data"}] giving "tables" that have rows with inconsistent lengths.
$endgroup$
– Carl Lange
6 hours ago










1 Answer
1






active

oldest

votes


















6












$begingroup$

You can solve this by importing "FullData" rather than "Data".



dataA1 = Import[
"http://www.nuforc.org/webreports/ndxe201804.html", {"HTML",
"FullData"}]
dataA2 = Flatten[Most@Rest@dataA1[[8]], 1]
dataA3 = Map[AssociationThread[dataA1[[8, 1, 1]], #] &, dataA2];
Dataset[dataA3]


What's awkward about this is that there can be many empty tables that you must sift through (as you can see, I have to get the 8th element). However, it works quite well in this case.






share|improve this answer









$endgroup$













  • $begingroup$
    Thank you, but I'm not completely sure how "Data" and "FullData" differ. Why do they exist separately?
    $endgroup$
    – Artem Anisimov
    5 hours ago






  • 2




    $begingroup$
    @ArtemAnisimov From the documentation for the "HTML" import/export format: "FullData" imports: "full tabular content, including empty HTML table and list elements". "Data" doesn't import empty elements.
    $endgroup$
    – Carl Lange
    5 hours ago











Your Answer





StackExchange.ifUsing("editor", function () {
return StackExchange.using("mathjaxEditing", function () {
StackExchange.MarkdownEditor.creationCallbacks.add(function (editor, postfix) {
StackExchange.mathjaxEditing.prepareWmdForMathJax(editor, postfix, [["$", "$"], ["\\(","\\)"]]);
});
});
}, "mathjax-editing");

StackExchange.ready(function() {
var channelOptions = {
tags: "".split(" "),
id: "387"
};
initTagRenderer("".split(" "), "".split(" "), channelOptions);

StackExchange.using("externalEditor", function() {
// Have to fire editor after snippets, if snippets enabled
if (StackExchange.settings.snippets.snippetsEnabled) {
StackExchange.using("snippets", function() {
createEditor();
});
}
else {
createEditor();
}
});

function createEditor() {
StackExchange.prepareEditor({
heartbeatType: 'answer',
autoActivateHeartbeat: false,
convertImagesToLinks: false,
noModals: true,
showLowRepImageUploadWarning: true,
reputationToPostImages: null,
bindNavPrevention: true,
postfix: "",
imageUploader: {
brandingHtml: "Powered by u003ca class="icon-imgur-white" href="https://imgur.com/"u003eu003c/au003e",
contentPolicyHtml: "User contributions licensed under u003ca href="https://creativecommons.org/licenses/by-sa/3.0/"u003ecc by-sa 3.0 with attribution requiredu003c/au003e u003ca href="https://stackoverflow.com/legal/content-policy"u003e(content policy)u003c/au003e",
allowUrls: true
},
onDemand: true,
discardSelector: ".discard-answer"
,immediatelyShowMarkdownHelp:true
});


}
});














draft saved

draft discarded


















StackExchange.ready(
function () {
StackExchange.openid.initPostLogin('.new-post-login', 'https%3a%2f%2fmathematica.stackexchange.com%2fquestions%2f193199%2fdeleting-missing-values-from-a-dataset%23new-answer', 'question_page');
}
);

Post as a guest















Required, but never shown

























1 Answer
1






active

oldest

votes








1 Answer
1






active

oldest

votes









active

oldest

votes






active

oldest

votes









6












$begingroup$

You can solve this by importing "FullData" rather than "Data".



dataA1 = Import[
"http://www.nuforc.org/webreports/ndxe201804.html", {"HTML",
"FullData"}]
dataA2 = Flatten[Most@Rest@dataA1[[8]], 1]
dataA3 = Map[AssociationThread[dataA1[[8, 1, 1]], #] &, dataA2];
Dataset[dataA3]


What's awkward about this is that there can be many empty tables that you must sift through (as you can see, I have to get the 8th element). However, it works quite well in this case.






share|improve this answer









$endgroup$













  • $begingroup$
    Thank you, but I'm not completely sure how "Data" and "FullData" differ. Why do they exist separately?
    $endgroup$
    – Artem Anisimov
    5 hours ago






  • 2




    $begingroup$
    @ArtemAnisimov From the documentation for the "HTML" import/export format: "FullData" imports: "full tabular content, including empty HTML table and list elements". "Data" doesn't import empty elements.
    $endgroup$
    – Carl Lange
    5 hours ago
















6












$begingroup$

You can solve this by importing "FullData" rather than "Data".



dataA1 = Import[
"http://www.nuforc.org/webreports/ndxe201804.html", {"HTML",
"FullData"}]
dataA2 = Flatten[Most@Rest@dataA1[[8]], 1]
dataA3 = Map[AssociationThread[dataA1[[8, 1, 1]], #] &, dataA2];
Dataset[dataA3]


What's awkward about this is that there can be many empty tables that you must sift through (as you can see, I have to get the 8th element). However, it works quite well in this case.






share|improve this answer









$endgroup$













  • $begingroup$
    Thank you, but I'm not completely sure how "Data" and "FullData" differ. Why do they exist separately?
    $endgroup$
    – Artem Anisimov
    5 hours ago






  • 2




    $begingroup$
    @ArtemAnisimov From the documentation for the "HTML" import/export format: "FullData" imports: "full tabular content, including empty HTML table and list elements". "Data" doesn't import empty elements.
    $endgroup$
    – Carl Lange
    5 hours ago














6












6








6





$begingroup$

You can solve this by importing "FullData" rather than "Data".



dataA1 = Import[
"http://www.nuforc.org/webreports/ndxe201804.html", {"HTML",
"FullData"}]
dataA2 = Flatten[Most@Rest@dataA1[[8]], 1]
dataA3 = Map[AssociationThread[dataA1[[8, 1, 1]], #] &, dataA2];
Dataset[dataA3]


What's awkward about this is that there can be many empty tables that you must sift through (as you can see, I have to get the 8th element). However, it works quite well in this case.






share|improve this answer









$endgroup$



You can solve this by importing "FullData" rather than "Data".



dataA1 = Import[
"http://www.nuforc.org/webreports/ndxe201804.html", {"HTML",
"FullData"}]
dataA2 = Flatten[Most@Rest@dataA1[[8]], 1]
dataA3 = Map[AssociationThread[dataA1[[8, 1, 1]], #] &, dataA2];
Dataset[dataA3]


What's awkward about this is that there can be many empty tables that you must sift through (as you can see, I have to get the 8th element). However, it works quite well in this case.







share|improve this answer












share|improve this answer



share|improve this answer










answered 6 hours ago









Carl LangeCarl Lange

4,61311038




4,61311038












  • $begingroup$
    Thank you, but I'm not completely sure how "Data" and "FullData" differ. Why do they exist separately?
    $endgroup$
    – Artem Anisimov
    5 hours ago






  • 2




    $begingroup$
    @ArtemAnisimov From the documentation for the "HTML" import/export format: "FullData" imports: "full tabular content, including empty HTML table and list elements". "Data" doesn't import empty elements.
    $endgroup$
    – Carl Lange
    5 hours ago


















  • $begingroup$
    Thank you, but I'm not completely sure how "Data" and "FullData" differ. Why do they exist separately?
    $endgroup$
    – Artem Anisimov
    5 hours ago






  • 2




    $begingroup$
    @ArtemAnisimov From the documentation for the "HTML" import/export format: "FullData" imports: "full tabular content, including empty HTML table and list elements". "Data" doesn't import empty elements.
    $endgroup$
    – Carl Lange
    5 hours ago
















$begingroup$
Thank you, but I'm not completely sure how "Data" and "FullData" differ. Why do they exist separately?
$endgroup$
– Artem Anisimov
5 hours ago




$begingroup$
Thank you, but I'm not completely sure how "Data" and "FullData" differ. Why do they exist separately?
$endgroup$
– Artem Anisimov
5 hours ago




2




2




$begingroup$
@ArtemAnisimov From the documentation for the "HTML" import/export format: "FullData" imports: "full tabular content, including empty HTML table and list elements". "Data" doesn't import empty elements.
$endgroup$
– Carl Lange
5 hours ago




$begingroup$
@ArtemAnisimov From the documentation for the "HTML" import/export format: "FullData" imports: "full tabular content, including empty HTML table and list elements". "Data" doesn't import empty elements.
$endgroup$
– Carl Lange
5 hours ago


















draft saved

draft discarded




















































Thanks for contributing an answer to Mathematica Stack Exchange!


  • Please be sure to answer the question. Provide details and share your research!

But avoid



  • Asking for help, clarification, or responding to other answers.

  • Making statements based on opinion; back them up with references or personal experience.


Use MathJax to format equations. MathJax reference.


To learn more, see our tips on writing great answers.




draft saved


draft discarded














StackExchange.ready(
function () {
StackExchange.openid.initPostLogin('.new-post-login', 'https%3a%2f%2fmathematica.stackexchange.com%2fquestions%2f193199%2fdeleting-missing-values-from-a-dataset%23new-answer', 'question_page');
}
);

Post as a guest















Required, but never shown





















































Required, but never shown














Required, but never shown












Required, but never shown







Required, but never shown

































Required, but never shown














Required, but never shown












Required, but never shown







Required, but never shown







Popular posts from this blog

Loup dans la culture

How to solve the problem of ntp “Unable to contact time server” from KDE?

Connection limited (no internet access)