Setting collection name dynamically based on original directory structure











up vote
1
down vote

favorite












I would like to dyamically add original directory name as
collection when loading files.



Suppose having following file: /home/sampledata/dir1/targetfile.xml
I would like targetfile.xml included in following collections:
"home", "sampledata", "dir1"



Can we do this while importing via MLCP?
or will be appreciate if anyone can share alternative way to achieve by script.










share|improve this question


























    up vote
    1
    down vote

    favorite












    I would like to dyamically add original directory name as
    collection when loading files.



    Suppose having following file: /home/sampledata/dir1/targetfile.xml
    I would like targetfile.xml included in following collections:
    "home", "sampledata", "dir1"



    Can we do this while importing via MLCP?
    or will be appreciate if anyone can share alternative way to achieve by script.










    share|improve this question
























      up vote
      1
      down vote

      favorite









      up vote
      1
      down vote

      favorite











      I would like to dyamically add original directory name as
      collection when loading files.



      Suppose having following file: /home/sampledata/dir1/targetfile.xml
      I would like targetfile.xml included in following collections:
      "home", "sampledata", "dir1"



      Can we do this while importing via MLCP?
      or will be appreciate if anyone can share alternative way to achieve by script.










      share|improve this question













      I would like to dyamically add original directory name as
      collection when loading files.



      Suppose having following file: /home/sampledata/dir1/targetfile.xml
      I would like targetfile.xml included in following collections:
      "home", "sampledata", "dir1"



      Can we do this while importing via MLCP?
      or will be appreciate if anyone can share alternative way to achieve by script.







      marklogic






      share|improve this question













      share|improve this question











      share|improve this question




      share|improve this question










      asked Nov 8 at 3:23









      Njbs

      905




      905
























          2 Answers
          2






          active

          oldest

          votes

















          up vote
          1
          down vote



          accepted










          You can use this query:



            let $dir-seprator := "/"
          for $uri in cts:uri-match("*")[fn:ends-with(.,'.xml')][1 to 10]
          let $collection := fn:substring-before($uri,fn:tokenize($uri,$dir-seprator)[fn:last()])
          return
          xdmp:document-set-collections($uri,fn:tokenize($collection,$dir-seprator)[.!='']))





          share|improve this answer





















          • This works, but doesn't scale well. Good for small scale (50k docs max) ad hoc adjustments, but less useful in production..
            – grtjn
            Nov 13 at 8:16










          • If someone has bigger data size, can enhance this code to run this in batches.
            – Navin Rawat
            Nov 13 at 9:31


















          up vote
          1
          down vote













          I'd recommend using an MLCP transform. It is most often used to manipulate the content before insert, but you can also adjust uri, collections, and more with it.



          For generic details on MLCP transform, see: https://docs.marklogic.com/guide/mlcp/import#id_82518



          For more specific details on transform output options, see: https://docs.marklogic.com/guide/mlcp/import#id_59764



          HTH!






          share|improve this answer





















            Your Answer






            StackExchange.ifUsing("editor", function () {
            StackExchange.using("externalEditor", function () {
            StackExchange.using("snippets", function () {
            StackExchange.snippets.init();
            });
            });
            }, "code-snippets");

            StackExchange.ready(function() {
            var channelOptions = {
            tags: "".split(" "),
            id: "1"
            };
            initTagRenderer("".split(" "), "".split(" "), channelOptions);

            StackExchange.using("externalEditor", function() {
            // Have to fire editor after snippets, if snippets enabled
            if (StackExchange.settings.snippets.snippetsEnabled) {
            StackExchange.using("snippets", function() {
            createEditor();
            });
            }
            else {
            createEditor();
            }
            });

            function createEditor() {
            StackExchange.prepareEditor({
            heartbeatType: 'answer',
            convertImagesToLinks: true,
            noModals: true,
            showLowRepImageUploadWarning: true,
            reputationToPostImages: 10,
            bindNavPrevention: true,
            postfix: "",
            imageUploader: {
            brandingHtml: "Powered by u003ca class="icon-imgur-white" href="https://imgur.com/"u003eu003c/au003e",
            contentPolicyHtml: "User contributions licensed under u003ca href="https://creativecommons.org/licenses/by-sa/3.0/"u003ecc by-sa 3.0 with attribution requiredu003c/au003e u003ca href="https://stackoverflow.com/legal/content-policy"u003e(content policy)u003c/au003e",
            allowUrls: true
            },
            onDemand: true,
            discardSelector: ".discard-answer"
            ,immediatelyShowMarkdownHelp:true
            });


            }
            });














            draft saved

            draft discarded


















            StackExchange.ready(
            function () {
            StackExchange.openid.initPostLogin('.new-post-login', 'https%3a%2f%2fstackoverflow.com%2fquestions%2f53201106%2fsetting-collection-name-dynamically-based-on-original-directory-structure%23new-answer', 'question_page');
            }
            );

            Post as a guest















            Required, but never shown

























            2 Answers
            2






            active

            oldest

            votes








            2 Answers
            2






            active

            oldest

            votes









            active

            oldest

            votes






            active

            oldest

            votes








            up vote
            1
            down vote



            accepted










            You can use this query:



              let $dir-seprator := "/"
            for $uri in cts:uri-match("*")[fn:ends-with(.,'.xml')][1 to 10]
            let $collection := fn:substring-before($uri,fn:tokenize($uri,$dir-seprator)[fn:last()])
            return
            xdmp:document-set-collections($uri,fn:tokenize($collection,$dir-seprator)[.!='']))





            share|improve this answer





















            • This works, but doesn't scale well. Good for small scale (50k docs max) ad hoc adjustments, but less useful in production..
              – grtjn
              Nov 13 at 8:16










            • If someone has bigger data size, can enhance this code to run this in batches.
              – Navin Rawat
              Nov 13 at 9:31















            up vote
            1
            down vote



            accepted










            You can use this query:



              let $dir-seprator := "/"
            for $uri in cts:uri-match("*")[fn:ends-with(.,'.xml')][1 to 10]
            let $collection := fn:substring-before($uri,fn:tokenize($uri,$dir-seprator)[fn:last()])
            return
            xdmp:document-set-collections($uri,fn:tokenize($collection,$dir-seprator)[.!='']))





            share|improve this answer





















            • This works, but doesn't scale well. Good for small scale (50k docs max) ad hoc adjustments, but less useful in production..
              – grtjn
              Nov 13 at 8:16










            • If someone has bigger data size, can enhance this code to run this in batches.
              – Navin Rawat
              Nov 13 at 9:31













            up vote
            1
            down vote



            accepted







            up vote
            1
            down vote



            accepted






            You can use this query:



              let $dir-seprator := "/"
            for $uri in cts:uri-match("*")[fn:ends-with(.,'.xml')][1 to 10]
            let $collection := fn:substring-before($uri,fn:tokenize($uri,$dir-seprator)[fn:last()])
            return
            xdmp:document-set-collections($uri,fn:tokenize($collection,$dir-seprator)[.!='']))





            share|improve this answer












            You can use this query:



              let $dir-seprator := "/"
            for $uri in cts:uri-match("*")[fn:ends-with(.,'.xml')][1 to 10]
            let $collection := fn:substring-before($uri,fn:tokenize($uri,$dir-seprator)[fn:last()])
            return
            xdmp:document-set-collections($uri,fn:tokenize($collection,$dir-seprator)[.!='']))






            share|improve this answer












            share|improve this answer



            share|improve this answer










            answered Nov 12 at 12:22









            Navin Rawat

            2,75111427




            2,75111427












            • This works, but doesn't scale well. Good for small scale (50k docs max) ad hoc adjustments, but less useful in production..
              – grtjn
              Nov 13 at 8:16










            • If someone has bigger data size, can enhance this code to run this in batches.
              – Navin Rawat
              Nov 13 at 9:31


















            • This works, but doesn't scale well. Good for small scale (50k docs max) ad hoc adjustments, but less useful in production..
              – grtjn
              Nov 13 at 8:16










            • If someone has bigger data size, can enhance this code to run this in batches.
              – Navin Rawat
              Nov 13 at 9:31
















            This works, but doesn't scale well. Good for small scale (50k docs max) ad hoc adjustments, but less useful in production..
            – grtjn
            Nov 13 at 8:16




            This works, but doesn't scale well. Good for small scale (50k docs max) ad hoc adjustments, but less useful in production..
            – grtjn
            Nov 13 at 8:16












            If someone has bigger data size, can enhance this code to run this in batches.
            – Navin Rawat
            Nov 13 at 9:31




            If someone has bigger data size, can enhance this code to run this in batches.
            – Navin Rawat
            Nov 13 at 9:31












            up vote
            1
            down vote













            I'd recommend using an MLCP transform. It is most often used to manipulate the content before insert, but you can also adjust uri, collections, and more with it.



            For generic details on MLCP transform, see: https://docs.marklogic.com/guide/mlcp/import#id_82518



            For more specific details on transform output options, see: https://docs.marklogic.com/guide/mlcp/import#id_59764



            HTH!






            share|improve this answer

























              up vote
              1
              down vote













              I'd recommend using an MLCP transform. It is most often used to manipulate the content before insert, but you can also adjust uri, collections, and more with it.



              For generic details on MLCP transform, see: https://docs.marklogic.com/guide/mlcp/import#id_82518



              For more specific details on transform output options, see: https://docs.marklogic.com/guide/mlcp/import#id_59764



              HTH!






              share|improve this answer























                up vote
                1
                down vote










                up vote
                1
                down vote









                I'd recommend using an MLCP transform. It is most often used to manipulate the content before insert, but you can also adjust uri, collections, and more with it.



                For generic details on MLCP transform, see: https://docs.marklogic.com/guide/mlcp/import#id_82518



                For more specific details on transform output options, see: https://docs.marklogic.com/guide/mlcp/import#id_59764



                HTH!






                share|improve this answer












                I'd recommend using an MLCP transform. It is most often used to manipulate the content before insert, but you can also adjust uri, collections, and more with it.



                For generic details on MLCP transform, see: https://docs.marklogic.com/guide/mlcp/import#id_82518



                For more specific details on transform output options, see: https://docs.marklogic.com/guide/mlcp/import#id_59764



                HTH!







                share|improve this answer












                share|improve this answer



                share|improve this answer










                answered Nov 8 at 8:45









                grtjn

                14.7k11730




                14.7k11730






























                    draft saved

                    draft discarded




















































                    Thanks for contributing an answer to Stack Overflow!


                    • Please be sure to answer the question. Provide details and share your research!

                    But avoid



                    • Asking for help, clarification, or responding to other answers.

                    • Making statements based on opinion; back them up with references or personal experience.


                    To learn more, see our tips on writing great answers.





                    Some of your past answers have not been well-received, and you're in danger of being blocked from answering.


                    Please pay close attention to the following guidance:


                    • Please be sure to answer the question. Provide details and share your research!

                    But avoid



                    • Asking for help, clarification, or responding to other answers.

                    • Making statements based on opinion; back them up with references or personal experience.


                    To learn more, see our tips on writing great answers.




                    draft saved


                    draft discarded














                    StackExchange.ready(
                    function () {
                    StackExchange.openid.initPostLogin('.new-post-login', 'https%3a%2f%2fstackoverflow.com%2fquestions%2f53201106%2fsetting-collection-name-dynamically-based-on-original-directory-structure%23new-answer', 'question_page');
                    }
                    );

                    Post as a guest















                    Required, but never shown





















































                    Required, but never shown














                    Required, but never shown












                    Required, but never shown







                    Required, but never shown

































                    Required, but never shown














                    Required, but never shown












                    Required, but never shown







                    Required, but never shown







                    這個網誌中的熱門文章

                    Academy of Television Arts & Sciences

                    L'Équipe

                    1995 France bombings