Azure Databricks Jupyter Notebook Python & R in 1 Cell
I have some code (mostly not my original code), that I have running on my local PC anaconda juptyer Notebook environment. I need to scale up the processing so I am looking into Azure Databricks. There's 1 section of code that's running a python loop but utilizes an R library (stats), then passes the data through an R model (tbats). So 1 Jupyter notebook cell runs python & R code. Can this be done in Azure Databricks JNB's as well? I only found documentation that lets you change languages from cell to cell.
In a previous cell I have:
%r libarary(stats) 
So the library is imported (along with other R libs). However when I run the code below, I get "NameError: name 'stats' is not defined. I am wondering if it's the way databricks wants you to tell the cell the language you're using (%r, %python, etc.) 
for customerid, dataForCustomer in original.groupby(by=['customer_id']):
    startYear = dataForCustomer.head(1).iloc[0].yr
    startMonth = dataForCustomer.head(1).iloc[0].mnth
    endYear = dataForCustomer.tail(1).iloc[0].yr
    endMonth = dataForCustomer.tail(1).iloc[0].mnth
    #Creating a time series object
    customerTS = stats.ts(dataForCustomer.usage.astype(int),
                      start=base.c(startYear,startMonth),
                      end=base.c(endYear, endMonth), 
                      frequency=12)
    r.assign('customerTS', customerTS)
    ##Here comes the R code piece
    try:
        seasonal = r('''
                    fit<-tbats(customerTS, seasonal.periods = 12, 
                                    use.parallel = TRUE)
                    fit$seasonal
                 ''')
    except: 
        seasonal = 1
    # APPEND DICTIONARY TO LIST (NOT DATA FRAME)
    df_list.append({'customer_id': customerid, 'seasonal': seasonal})
    print(f' {customerid} | {seasonal} ')
seasonal_output = pa.DataFrame(df_list)
Thank you
python r
 azure jupyter-notebook databricks
azure jupyter-notebook databricks add a comment |
I have some code (mostly not my original code), that I have running on my local PC anaconda juptyer Notebook environment. I need to scale up the processing so I am looking into Azure Databricks. There's 1 section of code that's running a python loop but utilizes an R library (stats), then passes the data through an R model (tbats). So 1 Jupyter notebook cell runs python & R code. Can this be done in Azure Databricks JNB's as well? I only found documentation that lets you change languages from cell to cell.
In a previous cell I have:
%r libarary(stats) 
So the library is imported (along with other R libs). However when I run the code below, I get "NameError: name 'stats' is not defined. I am wondering if it's the way databricks wants you to tell the cell the language you're using (%r, %python, etc.) 
for customerid, dataForCustomer in original.groupby(by=['customer_id']):
    startYear = dataForCustomer.head(1).iloc[0].yr
    startMonth = dataForCustomer.head(1).iloc[0].mnth
    endYear = dataForCustomer.tail(1).iloc[0].yr
    endMonth = dataForCustomer.tail(1).iloc[0].mnth
    #Creating a time series object
    customerTS = stats.ts(dataForCustomer.usage.astype(int),
                      start=base.c(startYear,startMonth),
                      end=base.c(endYear, endMonth), 
                      frequency=12)
    r.assign('customerTS', customerTS)
    ##Here comes the R code piece
    try:
        seasonal = r('''
                    fit<-tbats(customerTS, seasonal.periods = 12, 
                                    use.parallel = TRUE)
                    fit$seasonal
                 ''')
    except: 
        seasonal = 1
    # APPEND DICTIONARY TO LIST (NOT DATA FRAME)
    df_list.append({'customer_id': customerid, 'seasonal': seasonal})
    print(f' {customerid} | {seasonal} ')
seasonal_output = pa.DataFrame(df_list)
Thank you
python r
 azure jupyter-notebook databricks
azure jupyter-notebook databricks add a comment |
I have some code (mostly not my original code), that I have running on my local PC anaconda juptyer Notebook environment. I need to scale up the processing so I am looking into Azure Databricks. There's 1 section of code that's running a python loop but utilizes an R library (stats), then passes the data through an R model (tbats). So 1 Jupyter notebook cell runs python & R code. Can this be done in Azure Databricks JNB's as well? I only found documentation that lets you change languages from cell to cell.
In a previous cell I have:
%r libarary(stats) 
So the library is imported (along with other R libs). However when I run the code below, I get "NameError: name 'stats' is not defined. I am wondering if it's the way databricks wants you to tell the cell the language you're using (%r, %python, etc.) 
for customerid, dataForCustomer in original.groupby(by=['customer_id']):
    startYear = dataForCustomer.head(1).iloc[0].yr
    startMonth = dataForCustomer.head(1).iloc[0].mnth
    endYear = dataForCustomer.tail(1).iloc[0].yr
    endMonth = dataForCustomer.tail(1).iloc[0].mnth
    #Creating a time series object
    customerTS = stats.ts(dataForCustomer.usage.astype(int),
                      start=base.c(startYear,startMonth),
                      end=base.c(endYear, endMonth), 
                      frequency=12)
    r.assign('customerTS', customerTS)
    ##Here comes the R code piece
    try:
        seasonal = r('''
                    fit<-tbats(customerTS, seasonal.periods = 12, 
                                    use.parallel = TRUE)
                    fit$seasonal
                 ''')
    except: 
        seasonal = 1
    # APPEND DICTIONARY TO LIST (NOT DATA FRAME)
    df_list.append({'customer_id': customerid, 'seasonal': seasonal})
    print(f' {customerid} | {seasonal} ')
seasonal_output = pa.DataFrame(df_list)
Thank you
python r
 azure jupyter-notebook databricks
azure jupyter-notebook databricks I have some code (mostly not my original code), that I have running on my local PC anaconda juptyer Notebook environment. I need to scale up the processing so I am looking into Azure Databricks. There's 1 section of code that's running a python loop but utilizes an R library (stats), then passes the data through an R model (tbats). So 1 Jupyter notebook cell runs python & R code. Can this be done in Azure Databricks JNB's as well? I only found documentation that lets you change languages from cell to cell.
In a previous cell I have:
%r libarary(stats) 
So the library is imported (along with other R libs). However when I run the code below, I get "NameError: name 'stats' is not defined. I am wondering if it's the way databricks wants you to tell the cell the language you're using (%r, %python, etc.) 
for customerid, dataForCustomer in original.groupby(by=['customer_id']):
    startYear = dataForCustomer.head(1).iloc[0].yr
    startMonth = dataForCustomer.head(1).iloc[0].mnth
    endYear = dataForCustomer.tail(1).iloc[0].yr
    endMonth = dataForCustomer.tail(1).iloc[0].mnth
    #Creating a time series object
    customerTS = stats.ts(dataForCustomer.usage.astype(int),
                      start=base.c(startYear,startMonth),
                      end=base.c(endYear, endMonth), 
                      frequency=12)
    r.assign('customerTS', customerTS)
    ##Here comes the R code piece
    try:
        seasonal = r('''
                    fit<-tbats(customerTS, seasonal.periods = 12, 
                                    use.parallel = TRUE)
                    fit$seasonal
                 ''')
    except: 
        seasonal = 1
    # APPEND DICTIONARY TO LIST (NOT DATA FRAME)
    df_list.append({'customer_id': customerid, 'seasonal': seasonal})
    print(f' {customerid} | {seasonal} ')
seasonal_output = pa.DataFrame(df_list)
Thank you
python r
 azure jupyter-notebook databricks
azure jupyter-notebook databricks python r
 azure jupyter-notebook databricks
azure jupyter-notebook databricks edited Nov 12 at 6:42


sai saran
344224
344224
asked Nov 11 at 23:24
David Squires
217
217
add a comment |
add a comment |
active
oldest
votes
Your Answer
StackExchange.ifUsing("editor", function () {
StackExchange.using("externalEditor", function () {
StackExchange.using("snippets", function () {
StackExchange.snippets.init();
});
});
}, "code-snippets");
StackExchange.ready(function() {
var channelOptions = {
tags: "".split(" "),
id: "1"
};
initTagRenderer("".split(" "), "".split(" "), channelOptions);
StackExchange.using("externalEditor", function() {
// Have to fire editor after snippets, if snippets enabled
if (StackExchange.settings.snippets.snippetsEnabled) {
StackExchange.using("snippets", function() {
createEditor();
});
}
else {
createEditor();
}
});
function createEditor() {
StackExchange.prepareEditor({
heartbeatType: 'answer',
autoActivateHeartbeat: false,
convertImagesToLinks: true,
noModals: true,
showLowRepImageUploadWarning: true,
reputationToPostImages: 10,
bindNavPrevention: true,
postfix: "",
imageUploader: {
brandingHtml: "Powered by u003ca class="icon-imgur-white" href="https://imgur.com/"u003eu003c/au003e",
contentPolicyHtml: "User contributions licensed under u003ca href="https://creativecommons.org/licenses/by-sa/3.0/"u003ecc by-sa 3.0 with attribution requiredu003c/au003e u003ca href="https://stackoverflow.com/legal/content-policy"u003e(content policy)u003c/au003e",
allowUrls: true
},
onDemand: true,
discardSelector: ".discard-answer"
,immediatelyShowMarkdownHelp:true
});
}
});
Sign up or log in
StackExchange.ready(function () {
StackExchange.helpers.onClickDraftSave('#login-link');
});
Sign up using Google
Sign up using Facebook
Sign up using Email and Password
Post as a guest
Required, but never shown
StackExchange.ready(
function () {
StackExchange.openid.initPostLogin('.new-post-login', 'https%3a%2f%2fstackoverflow.com%2fquestions%2f53254262%2fazure-databricks-jupyter-notebook-python-r-in-1-cell%23new-answer', 'question_page');
}
);
Post as a guest
Required, but never shown
active
oldest
votes
active
oldest
votes
active
oldest
votes
active
oldest
votes
Thanks for contributing an answer to Stack Overflow!
- Please be sure to answer the question. Provide details and share your research!
But avoid …
- Asking for help, clarification, or responding to other answers.
- Making statements based on opinion; back them up with references or personal experience.
To learn more, see our tips on writing great answers.
Some of your past answers have not been well-received, and you're in danger of being blocked from answering.
Please pay close attention to the following guidance:
- Please be sure to answer the question. Provide details and share your research!
But avoid …
- Asking for help, clarification, or responding to other answers.
- Making statements based on opinion; back them up with references or personal experience.
To learn more, see our tips on writing great answers.
Sign up or log in
StackExchange.ready(function () {
StackExchange.helpers.onClickDraftSave('#login-link');
});
Sign up using Google
Sign up using Facebook
Sign up using Email and Password
Post as a guest
Required, but never shown
StackExchange.ready(
function () {
StackExchange.openid.initPostLogin('.new-post-login', 'https%3a%2f%2fstackoverflow.com%2fquestions%2f53254262%2fazure-databricks-jupyter-notebook-python-r-in-1-cell%23new-answer', 'question_page');
}
);
Post as a guest
Required, but never shown
Sign up or log in
StackExchange.ready(function () {
StackExchange.helpers.onClickDraftSave('#login-link');
});
Sign up using Google
Sign up using Facebook
Sign up using Email and Password
Post as a guest
Required, but never shown
Sign up or log in
StackExchange.ready(function () {
StackExchange.helpers.onClickDraftSave('#login-link');
});
Sign up using Google
Sign up using Facebook
Sign up using Email and Password
Post as a guest
Required, but never shown
Sign up or log in
StackExchange.ready(function () {
StackExchange.helpers.onClickDraftSave('#login-link');
});
Sign up using Google
Sign up using Facebook
Sign up using Email and Password
Sign up using Google
Sign up using Facebook
Sign up using Email and Password
Post as a guest
Required, but never shown
Required, but never shown
Required, but never shown
Required, but never shown
Required, but never shown
Required, but never shown
Required, but never shown
Required, but never shown
Required, but never shown